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(57) Abstract 

A method of inhibiting gene expression is 
described. The method, which affects enzymatic 
activity in a plant, comprises expressing in a plant 
(or a cell, a tissue or an organ thereof) a nucleotide 
sequence wherein the nucleotide sequence codes, 
partially or completely, for an intron in an antisense 
orienution of a class A SBE; and wherein the 
nucietoide sequence does not contain a sequence 
that is antisense to an exon sequence normally 
associated with the intron. * 
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ANTISENSE INHtON INHmiTION OF STARCH BRANCHING ENZYME EXPRESSION 

The present invention relates to a method of inlubiting gene expression* particularly 
inhibiting gene expression in a plant. The present invention also relates to a nucleotide 

5 sequence useful in the method. In addition, the present invention relates to a promoter 
that is useful for expressing the nucleotide sequence. 

Starch is one of the main storage carbohydrases in plants, especially higher plants. 
The structure of starch consists of amylose and amylopectin. Amylose consists essemially 
of straight chains of a-l-4-linked glycosyl residues. Amylopectin comprises chains of 

10 a-M-linked glycosyl residues with some a-l-6 branches. The branched namre of 
amylopectin is accomplished by the action of inter alia an enzyme commonly known as 
the starch branching enzyme ('*SBE**). SBE catalyses the fonmation of branch points in 
. the amylopectin molecule by adding a-1,4 glucans through a-l,6-glucosidic branching 
linkages. The biosynthesis of amylose and amylopecun is schematically shown in Figtire 

IS I, whereas the a-l-4-Iinks ami the a-1-6 links are shown in Figure 2. 

In Potato, it is known that two classes of SBE exist. In our copending 
international patent applications PCT/EP96/03052 and PCT/EP96/03053, class B potato 
SBE and a gene encoding it are discussed. In international patent application 
W096/34968, class A potato SBE and a cDNA encoding it are disclosed. 

20 It is known that starch is an importam raw material. Starch is widely tised in the 

food, paper, and chemical industries. However, a large fraction of the starches used in 
these industrial applications are post-harvest modified by chemical, physical or enzymatic 
methods in order to obtain starches with certain required fimaional properties. 

Withm the past few years it has become desirable to make genetically modified 

25 plants which could be capable of producing modified starches which could be the same as 
the post-harvest modified starches. It is also known that it may be possible to prepare 
such genetically modified plants by expression of antisense nucleotide coding sequences. 
In this regard, June Bourque provides a detailed summaiy of antisense strategies for the 
genetic manipulations in plants (Bourque 1995 Plant Science IQ2 pp 12S-149). At this 

30 stage, reference could be made to Figure 3 whidi is a schematic diagram of one of the 
proposed mechanisms of antisense-RNA inhibition. 
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In particular. WO 92/11375 reports on a method of genetically modifying potato 
so as to form amylose-type starch. The method involves the use of an anti-sense 
construct diat can apparently inhibit, to a vaiying extent, the expression of the gene 
coding for fonnation of the branching enzyme in potato. The antisense construct of WO 
5 92/1 1375 consists of a tuber specific promoter, a transcription start sequence and the first 
exon of the branching enzyme in antisense direction. However, WO 92/11375 does not 
provide any antisense sequence data. In addition. WO 92/11375 only discloses the use of 
the potato GBSS promoter. 

WO 92/14827 reports on a plasmid that, after insertion into the genome of a plant, 
10 can apparently cause changes in the carbohydrate concentration and caibohydrate 
composition, such as the concentration and composition of amylose and amylopectin. in 
the regenerated plant. The plasmid contains part of the coding sequence of a branching 
enzyme in an antisense orientation. 

EP-A-0647715 reports on the use of antisense endogenous mRNA coding DNA to 
15 alter the characteristics and the metabolic pathways of ornamental plants. 

EP-A-0467349 reports on the expression of sequences that are antisense to 
sequences upstream of a promoter to control gene expression. 

EP-A-0458367 and US-A-5 107065 report on the expression of a nucleotide 
sequence to regulate gene expression in a plant. The nucleotide sequence is 
20 complementary to a mRNA sequence of a gene and may cover all or a portion of the non- 
coding region of the gene. In other words, the nucleotide sequences of EP-A-0458367 
and US-A-5 107065 must at least comprise a sequence that is complementary to a coding 
region. EP-A-0458367 and US-A-5 107065 contain minimal sequence information. 

W096/34968 discusses the use of antisense sequences conplementary to 
25 sequences which encode class A and class B potato SBE to downregulate SBE expression 
in potato plants. The sequences used are complementary to SBE coding sequences. 

Kuipers et al in Mol. Gen. Genet. [1995] 246 745-755 report on the expression of 
a series of nucleotides that are antisense to part of the genomic intron sequences of potato 
granule bound starch synthetase. Here the antisense intron sequences are attached to a 
30 part of the antisense exon sequences - wherein the intron sequences and the exon 
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sequences are naturally associated with each other. In addition, the expressed antisense 
intron sequences are at most 231 bp in length. 

Likewise, Kull et al in J. Genet & Breed. I199S] 4S 69-76 lepon on the 
expression of a series of nucleotides that are antisense to part of the genomic intron 

5 sequences of potato granule bound staich synthetase. Likewise, here the antisense intron 
sequences are attached to a part of the antisense exon sequences • wherein the intron 
sequences and the exon sequences are naturally associated with each other. In addition, 
likewise, the expressed antisense intron sequences are at most 231 bp in length. 

Shimada et al in Theor, Appl. Genet. [1993] Sfi 665-672 report on the expression 

10 of a series of nucleotides that are antisense to part of the genomic intron sequences of rice 
granule bound starch synthetase. Here the anusense intron sequences are attached to a 
pan of the antisense exon sequences - wherein the intron sequences and the exon 
sequences are naturally associated with each other. In addiuon, the expressed antisense 
intron sequences are less than 330 bp in length. 

15 Reviews on how enzymatic activity can be affected by expression of particular 

nucleotide sequences niay be found in the teachings of Finnegan and McElroy [1994] 
Biotechnology 12 883-888; and Matzke and Matzke [1995] TIG 11 1-3. 

Whilst it is known that enzymatic activity can be affected by expression of 
particular nucleotide sequences there is still a need for a method that can more reliably 

20 and/or more efficiently and/or more specifically affect enzymatic activity. 

According to a first aspect of the present invention there is provided a method of 
affecting enzymatic activity in a plant (or a cell, a tissue or an organ thereoO comprisii^ 
expressing in the plant (or a cell, a tissue or an organ thereof) a nucleotide sequence 
wherein the nucleotide sequence partially or completely codes for (is) an intron of the 

25 potato class A SBE gene in an antisense orientation optionally together with a nucleotide 
sequence which codes, partially or completely, for an intron of a class B starch branchii^ 
enzyme in an antisense or sense orientation; and wherein the nucleodde sequence does not 
contain a sequence that is antisense to an exon sequence normally associated with the 
intron. 

30 According to a second aspect of the present invention there is provided a method 

of affecting enzymatic activity in a starch producing organism (or a cell, a tissue or an 
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organ thereof) comprising expressing in the stardi producing organism (or a cell, a tissue 
or an organ thereof) a nucleotide sequence wherek the nucleotide sequence codes, 
partially or completely, for an intron of the potato class A SBE gene, in an aruisense 
orientation optionally together with a nucleotide sequence which codes, partially or 

5 completely, for an intron of a class B starch branching enzyme in an antisense or sense 
orientation; and wherein starch branching enzyme activity is affected and/or the levels of 
amylopectin are affected and/or the con^)osition of starch is changed. 

Preferably, the class A SBE gene antisense intron construct is used in combination 
with a potato class B SBE gene antisense intion construct as defined in PCT/EP96/030S2. 

10 However, it may also be used independently thereof, to target class A SBE alone, or in 
combination with other transgenes, to further manipulate starch quality in potato plants. 

According to a third aspea of the present invention, therefore, there is provided 
an antisense sequence comprising the nucleotide sequence shown as any one of SEQ.I.D. 
No. 15 to SEQ.I.D. No. 27 and the complement of SEQ. ID. No.38, or a variant. 

IS derivative or homologue thereof. 

According to a fourth aspect of the present invention there is provided a promoter 
comprising the sequence shown as SEQ.LD. No. 14 or a variant, derivative or 
homologue thereof. 

According to a fifth aspea of the present invention there is provided a construa 
20 capable of comprising or expressing the present invention. 

According to a sixth aspect of the present invention there is provided a vector 
comprising or expressing the present invention. 

According to a seventh aspect of the present invention there is provided a cell, 
tissue or organ comprising or expressing the present invention. 
25 According to an eighth aspect of the present invention there is provided a 

transgenic starch producing organism comprising or expressing the present invention. 

According to a ninth aspect of the present invention there is provided a starch 
obtained from the present invention. 

According to a tenth aspect of the present invention there is provided pSS17 and 

30 pSS18. 
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According to an eleventh aspect of the present invention there is provided a 
nucleotide sequence that is antisense to any one or more of the intron sequences 
obtainable from class A SBE, and especially those obtainable from intron loif class A SBE 
as set fordi in SEQ. ID. No. 38. 

5 A key advantage of the present invention is that it provides a method for prqurira 

modified starches that is not dependent on die need for post-harvest modification of 
starches. Thus the method of the present invention obviates the need for the use of 
hazardous chemicals that are normally used in the post*harvest modification of starches. 
In addition, the present invention provides imer alia genetically modified plants 

10 which are capable of producing modified and/or novel and/or improved starches whose 
properties would satisfy various industrial requirements. 

Thus, the present invention provides a method of preparing tailor-made starches in 
plants which could replace the post-harvest modified starches. 

Also, the present invention provides a method (hat enables modified starches to be 

IS prepared by a method that can have a more beneficial e%ct on the environment than the 
known post-harvest modification methods which are dependent on the use of hazardous 
chemicals and large quantities of energy. 

An other key advantage of the present invention is that it provides a method that 
may more reliably and/or more efficiendy and/or more specifically affect en^matic 

20 activity when compared to the known methods of affecting enzymatic activity. With 
regard to this advantage of the present invention it is to be noted that there is some degree 
of homology between coding regions of SBEs. However, there is little or no homology 
with the intron sequences of SBEs. 

Thus, antisense intron expression provides a mechanism to affect selectively the 

25 expression of a particular class A SBE. This advantageous aspect could be used, for 
example, to reduce or eliminate a particular SBE enzyme, especially a class A SBE 
enzyme, and replace that enzyme with anoth^ enzyme which can be another branching 
enzyme or even a recombinant version of the affected enzyme or even a hybrid enzyme 
which could for example comprise part of a SBE . enzyme from one source and at least a 

30 part of another SBE enzyme from another source. This particular feature of the present 
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invention is covered by the combination aspect of the present invention which is discussed 
in more detail later. 

Thus the present invention provides a mechanism for selectively affecting class A 
SBE activity. This is in contrast to the prior an methods which arc dependent on the use 

5 of for example antisense exon expression whereby it would not be possible to introduce 
new SBE activity without affecting that activity as well 

In the context of the present invention, class B SBE is synonymous with SBE I: 
class A SBE is synonymous with SBE II. Class A SBE is as defmed in W096/34968, 
incorporated herein by reference. Preferably, the antisense intron construct used 

10 comprises intron 1 of class A SBE, which is 2.0 kb in length and is located starting at 
residue 45 of the coding sequence of class A SBE. The boundaries of the imron may be 
calculated by searching for consensus intron boundary sequences, and are shown in 
attached figure 13. Class B SBE is substantially as defuied in the sequences given herein 
and in PCT/EP96/03052. 

15 Preferably with the first aspect of the present invention starch branching enzyme 

activity is affected and/or the levels of amylopectin are affected and/or the composition of 
starch is changed. 

Preferably with the second aspect of the present invention the nucleotide sequence 
does not contain a sequence that is antisense to an exon sequence normally associated with 
20 the intron. 

Preferably with the fourth aspect of the present invention the promoter is in 
combination with a gene of interest ("GOI"). 

Preferably the enzymatic activity is reduced or eliminated. 

Preferably the nucleotide sequence codes for at least substantially all of at least 
25 one intron in an antisense orientation. 

Preferably the nucleotide sequence codes, pardally or completely, for two or more 
introns and wherein each intron is in an anti-sense orientation. 

Preferably the nucleotide sequence comprises at least 350 nucleotides (e.g. at least 
350 bp), more preferably at least 500 nucleotides (e.g. at least 500 bp). 
30 Preferably the nucleotide sequence comprises the complement of the sequence 

shown in SEQ. ID. No. 38, or a fragment thereof. 
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Preferably the nucleotide sequence is expressed by a promoter havii^ a sequence 
shown as SEQ. LD. No 14 or a variant, derivative or homologue thereof. 

Preferably the transgenic starch producing organism is a plant. 

A preferred aspect of the present invention therefore relates to a method of 
5 affecting enzymatic activity in a plant (or a cell, a tissue or an organ thereoO comprising 
expressing- in the plant (or a cell, a tissue or an oigan thereoO a nucleotide sequence 
wherein the nucleotide sequence codes, partially or completely, for an intron in an 
antisense orientation; wherein the nucleotide sequence does not contain a sequence that is 
antisense to an exon sequence normally associated with the intron; and wherein starch 
10 branching enzyme activity is affected and/or the levels of amylopectin are affected and/or 
the composition of starch is changed. 

A more preferred aspea of the present invention therefore relates to a method of 
affecting enzymatic activity in a plant (or a cell, a tissue or an organ thereoO comprising 
expressing in the plant (or a cell, a tissue or an organ thereof) a nucleotide sequence 
IS wherein the nucleotide sequence codes, partially or completely, for an intron in an 
antisense orientation; wherein die nucleotide sequence does not contain a sequence that is 
antisense to an exon sequence normally associated with the intron: wherein starch 
branching enzyme activity is affected and/or the levels of amylopectin are affected and/or 
the composition of starch is changed; and wherein the nucleotide sequence comprises the 
20 sequence shown as any one of SEQ.I.D. No. 15 to SEQ.I.D. No. 27 or a variant, 
derivative or homologue thereof, including combinations thereof. 

The term "nucleotide" in relation to the present invention includes DNA and 
RNA. Preferably it means DNA. more preferably DNA prepared by use of recombinant 
DNA techniques. 

25 The term "intron" is used in its normal sense as meaning a segment of 

nucleotides, usually DNA, that is uanscribed but does not encode part or all of an 

expressed protein or enzyme. 

The term "exon" is used in its normal sense as meaning a segment of nucleotides, 

usually DNA, encoding pan or all of an expressed protein or en^me. 
30 Thus, the term intron" refers to gene regions that are transcribed into RNA 

molecules, but which are spliced out of the RNA before the RNA is translated into a 
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protein. In contrast, the term "exon" refers to gene regions that are transcribed into RNA 
and subsequently translated into proteins. 

The terms "variant** or "homologue" or "fragment** in relation to the nucleotide 
sequence of the present invention include ai^ substitution of, variation of, modification 

5 of, replacement of, deletion of or addition of one (or more) nucleic acid from or to the 
respective nucleotide sequence providii^ the resultant nucleotide sequence can affect 
enzyme activity in a plant, or cell or tissue dieieof, preferably wherein the resultant 
nucleotide sequence has at least the same effect as the complement of the sequence shown 
as SEQ.I.D. No. 38. In particular, the term "homologue** covers homology with respect 

10 to similarity of stnicmre and/or similarity of function providing the resultant nucleotide 
sequence has the ability to affect enzymatic activity in accordance with the present 
invention. With respect to sequence homology (i.e. similarity), preferably there is more 
than 80% homology, more preferably at least 85% homology, more preferably at least 
90% homology, even more preferably at least 95% homology, more preferably at least 

IS 98% homology. The above terms are also synonymous with allelic variations of the 
sequences. 

Likewise, the terms "variant" or "^homologue" or ''fragment" m relation to the 
promoter of the present invention include any substimtion of, variation of, modification 
' of, replacement of, deletion of or addition of one (or more) nucleic acid from or to die 

20 respective promoter sequence providit^ the resultant promoter sequence allows 
expression of a GOI, preferably wherein the resultant promoter sequence has at least the 
same effect as SEQ.I.D. No. 14. In particular, the term "homologue" covers homology 
with respect to similarity of structure and/or similarity of function providing the resultaru 
promoter sequence has the ability to allow for expression of a GOI, such as a nucleotide 

25 sequence according to the present invention. With respect to sequence homology (i.e. 
similarity), preferably diere is more than 80% homology, more preferably at least 85% 
homology, more preferably at least 90% homology, even more preferably at least 95% 
homology, more preferably at least 98% homology. The above terms are also 
synonymous with allelic variations of the sequences. 
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The term "antisense" means a nucleotide sequence that is complementary to, and 
can therefore hybridise with, any one or all of the intron sequences of the present 
invention, including partial sequences thereof. 

With the present invention, the antisense intron can be complementary to an entire 

5 intron of the gene to be inhibited. However, in some circumstances, partial antisense 
sequences may be used (i.e. sequences that are not or do not comprise the fiill 
complementary sequence) providing the partial sequences affect enzymatic activity. 
Suitable examples of partial sequences include sequences that are shorter than the full 
complement of SEQ. ID. No. 38 but which conq)rise nucleotides that are at least 

10 antisense to the sense intron sequences adjacent the respective exon or exons. 

With regard to the second aspect of the present invention (i.e. specifically 
affecting SBE activity), the nucleotide sequences of the present invention may comprise 
one or more sense or antisense exon sequences of the SBE gene, including complete or 
partial sequences thereof, providing the nucleotide sequences can affect SBE activity, 

IS preferably wherein the nucleotide sequences reduce or eliminate SBE aatvity. 
Preferably, the nucleotide sequence of the second aspect of the present invention does not 
comprise an antisense exon sequence. 

The term "vector" includes an expression vector and a transformation vector. The 
term "expression vector" means a construct capable of in vivo or in vitro expression. The 

20 term "transformation vector" means a construct capable of being transferred from one 
species to another • such as from an E.Coii plasmid to a fungus or a plant cell, or from an 
Agrobacterium to a plant cell. 

The term "construct" - which is synonymous with terms such as "conjugate", 
"cassette" and "hybrid" - in relation to ttie antisense nucleotide sequence aspect of die 

25 present invention includes the nucleotide sequence according to the present invention 
directiy or indirectly attached to a promoter. An example of an indirect attachment is the 
provision of a suitable spacer group such as an intron sequence, such as the Shl-iatxon or 
the ADH intron, intermediate the promoter and the nucleotide sequence of the present 
invention. The same is true for the term "fused" in relation to the present invention 

30 which includes direct or indirect attachment. The terms do not cover the namral 
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combination of the wild type SBE gene when associated with the wild type SBE gene 
promoter in their natural environment. 

The construct may even contain or express a marker which allows for ttie 
selection of the genetic construa in, for example, a plant cell into which it has been 
5 transferred. Various markers exist which may be used in, for example* plants - such as 
mannose. Other examples of markers include those that provide for antibiotic resistance - 
e.g. resistance to G418, hygromycin, bleomycin, kanamycin and gentamycin. 

The construct of the present invention preferably comprises a promoter. The term 
"promoter" is used in the normal sense of the art, e.g. an RNA polymerase binding site in 

10 the Jacob-Monod theoiy of gene expression. Examples of suitable promoters are those 
that can direct efficient expression of the nucleotide sequence of the present invention 
and/or in a specific type of cell. Some examples of tissue specific promoters are 
disclosed in WO 92/11375. 

The promoter could additionally include conserved regions such as a Pribnow Box 

IS or a TATA box. The promoters may even contain other sequences to affect (such as to 
maintain, enhance, decrease) the levels of expression of the nucleotide sequence of the 
present invention. Suitable examples of such sequences include the 5/i7-intron or an 
ADH intron. Other sequences include inducible elements - such as temperature, 
chemical, light or stress inducible elements. Also, suitable elements to enhance 

20 transcription or translation may be present. An example of the latter element is the TMV 
5' leader sequence (see Sleat (jene 217 [1987] 217-225; and Dawson Plant Mol. Biol. 23 
[1993] 97), 

As mentioned, the construct and/or the vector of the present invention may 
include a transcriptional initiation region which may provide for regulated or constimtive 
25 expression. Any suitable promoter may be used for the transcriptional initiation region, 
such as a tissue specific promoter. In one aspect, preferably the promoter is the patatin 
promoter or the E35S promoter. In another aspect, preferably the promoter is the SBE 
promoter. 

If, for example, the organism is a plant then the promoter can be one that affects 
30 expression of the nucleotide sequence in any one or more of seed, mber. stem, sprout, 
root and leaf tissues, preferably tuber. By way of example, the promoter for the 
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nucleotide sequence of the present invention can be the a-Amy 1 promoter (otherwise 
loiown as the Amy 1 promoter* the Amy 637 promoter or the a-Amy 637 promoter) as 
described in our co-pending UK patent application No. 9421292.5 filed 21 October 1994. 
Alternatively, the promoter for the nudeoUde sequence of the present invention can be the 

5 a- Amy 3 promoter (otherwise known as the Amy 3 promoter, the Amy 351 promoter or 
the a-Amy 351 promoter) as described in our co*pendiiig UK patent application No. 
9421286.7 filed 21 October 1994. 

The present invention also encompasses the use of a promoter to express a 
nucleotide sequence according to the present invention, wherein a part of the promoter is 

10 ^activated but wherein the promoter can still function as a promoter. Partial inactivation 
of a promoter in some instances is advantageous. In particular, with the Amy 351 
promoter mentioned earlier it is possible to inactivate a pan of it so that the partially 
inactivated promoter expresses the nucleotide sequence of the present invention in a more 
specific manner such as in just one specific tissue type or organ. The term "inactivated" 

IS means partial inactivation in the sense that the expression pattern of the promoter is 
modified but wherein the panially inactivated promoter still functions as a promoter. 
However, as mentioned above, the modified promoter is capable of expressing a gene 
coding for the enzyme of the present invention in at least one (but not all) specific tissue 
of the original promoter. Examples of partial inactivation include altering the foldiiig 

20 pattern of the promoter sequence, or binding species to parts of the nucleotide sequence, 
so that a part of the nucleotide sequence is not recognised by, for example, RNA 
polymerase. Another, and preferable, way of partially inactivating the promoter is to 
truncate it to form fragments thereof. Another way would be to mutate at least a part of 
the sequence so that the RNA polymerase can not bind to that part or another part. 

25 Anodier modification is to mutate the binding sites for regulatory proteins for example the 
CreA protein known from filamentous fimgi to exert carbon catabolite repression, and 
thus abolish the catabolite repression of the native promoter. 

The construct and/or the vector of the present invention may include a 
transcriptional termination region. 

30 The nucleotide according to the present invention can be expressed in combination 

(but not necessarily at the same time) with an additional construct. Thus the present 
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invention also provides a combinadpn of constructs comprising a first construct 
comprising the nucleotide sequence according to the present invention operatively linked 
to a first promoter; and a second construct conq)risix% a GOI operatively linked to a 
second promoter (which need not be die same as the first promoter). With this aspect of 

S the present invention the combination of constructs may be present in the same vector, 
plasmid, cells, tissue, organ or organism. This aspect of the preseru invention also covers 
methods of expressing the same, preferably in specific cells or tissues, such as expression 
in just a specific cell or tissue, of an organism, typically a plant. With this aspect of the 
present invention the second construct does not cover the natural combination of the gene 

10 coding for an enzyme ordinarily associated with the wild type gene promoter when they 
are both in their natural envirorunent. 

An example of a suitable combination would be a first construct comprising the nucleotide 

sequence of the present invention and a promoter^ such as the promoter of the present 
IS invention, and a second construct comprising a promoter, such as the promoter of the 

present invention, and a GOI wherein the GOI codes for another starch branching enzyme 

either in sense or antisense orientation. 

The above conunents relating to the term "construct** for the andsense nucleotide 

aspect of the present invention are equally applicable to the term "construct" for the 
20 promoter aspect of the present invention. In this regard, the term includes the promoter 

according to the present invention direcdy or indirectly attached to a GOI. 

The term "GOI" with reference to the promoter aspect of the present invention or 

the combination aspect of the present invention means any gene of interest, which need 

not necessarily code for a protein or an enzyme - as is explained later. A GOI can be any 
25 nucleotide sequence that is either foreign or natural to the organism in question, for 

example a plant. 

Typical examples of a GOI include genes encoding for other proteins or enzymes 
that modify metabolic and catabolic processes. The GOI may code for an agent for 
introducing or increasing pathogen resistance. 
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The GOI may even be an antisense construct for modifying the expression of 
natural transcripts present in the relevant tissues. An example of such a GOI is the 
nucleotide sequence according to the present invention. 

The GOI may even code for a protein that is non-namral to the host organism - 

5 e.g. a plant. The GOI may code for a compound that is of benefit to animals or humans. 
For example, the GOI could code for a pharmaceutically active protein or enzyme such as 
any one of the therapeutic compounds insulin, interferon, human serum albumin, human 
growth factor and blood clotting factors. The GOI may even code for a protein giving 
additional nutritional value to a food or feed or crop. Typical examples inchide plant 

10 proteins that can inhibit the formation of anti-nutritive factors and plant proteins diat have 
a more desirable amino acid composition (e.g. a higher lysine content than a non- 
transgenic plant). The GOI may even code for an enzyme that can be used in food 
processing such as xylanases and a-galactosidase. The GOI can be a gene encoding for 
any one of a pest toxin, an antisense transcript such as that for a-amylase, a protease or a 

15 glucanase. Alternatively, the GOI can be a nucleotide sequence according to the present 
invention. 

The GOI can be the nucleotide sequence coding for the arabinofiiranosidase 
enzyme which is the subject of our co-pending UK patent application 9505479.7. The 
GOI can be the nucleotide sequence coding for the glucanase enzyme which is the subject 

20 of our co-pending UK patent application 9505475.5. The GOI can be the nucleotide 
sequence coding for the a-amylase enzyme which is the subject of our co-pending UK 
patent application 9413439.2. The GOI can be the nucleotide sequence coding for the a- 
amylase enzyme which is the subject of our co-pending UK patent application 9421290.9. 
The GOI can be any of the nucleotide sequences coding for the a-glucan lyase enzyme 

25 which are described in our co-pending PCT patent application PCT/EP94/03397. 

In one aspect the GOI can even be a nucleotide sequence according to the present 
invention but when operatively linked to a different promoter. 

The GOI could include a sequence that codes for one or more of a xylanase, an 
arabinase, an acetyl esterase, a rfaanmogalacturonase, a glucanase, a pectinase, a 

30 branching enzyme or another carbohydrate modifying enzyme or proteinase. 
Alternatively, the GOI may be a sequence that is antisense to any of those sequences. 
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As mentioned above, the present invention provides a mechanism for selectively 
affecting a particular enzymatic activity. In an important application of the present 
invention it is now possible to reduce or eliminate expression of a genomic nucleotide 
sequence coding for a genomic protein or enzyme by expressing an amisense intron 

5 construct for that particular genomic protein or enzyme and (e.g. at the same time) 
expressing a recombinant version of that enzyme or protein - in other words the GOI is a 
recombinant nucleotide sequence coding for the genomic enzyme or protein. This 
application allows expression of desired recombinant enzymes and proteins in the absence 
of (or reduced levels of) respective genomic enzymes and proteins. < Thus the desired 

10 recombinant enzymes and proteins can be easily separated and purified from the host 
organism. This particular aspect of the present invention is very advantageous over the 
prior art methods which, for example, rely on the use of anti*sense exon expression which 
methods also affea expression of the recombinant enzyme. 

Thus, a iurther aspect of the present bvention relates to a method of expressing a 

15 recombinant protein or enzyme in a host organism comprising expressing a nucleotide 
sequence coding for the recombinant protein or enzyme; and expressii^ a further 
nucleotide sequence wherein the further nucleotide sequence codes, partially or 
completely, for an intron in an antisense orientation; wherein the intron is an intron 
normally associated with the genomic gene encoding a protein or an enzyme 

20 corresponding to the recombinant protein or enzyme; and wherein the further nucleotide 
sequence does not contain a sequence that is antisense to an exon sequence normally 
associated with the intron. Additional aspects cover the combination of those nucleotide 
sequences including their mcorporation in constructs, vectors, cells, tissues and transgenic 
organisms. 

25 Therefore the present invention also relates to a combination of nucleotide 

sequences comprising a first nucleotide sequence coding for a recombinant enzyme; and a 
second nucleotide sequence which corresponds to an intron in aiuisense orientation; 
wherein the intron is an intron that is associated with a genomic gene encoding an enzyme 
corresponding to the recombinant enzyme; and wherein the second nucleotide sequence 

30 does not contain a sequence that is antisense to an exon sequence normally associated with 
the intron. 
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The GOI may even code for one or more introns, such as any one or more of the 
intron sequences presiented in the attached sequence listings. For example, the present 
invention also covers the expression of for exanq>le an antisense intron (e.g. the 
complement of SEQ. ID. No. 38) in combination with for example a sense intron which 
5 preferably is not complementary to the antisense intron sequence (e.g. SEQ.I.D.No. 2 or 
another class A SBE intron). 

The terms "cell", "tissue" and organ" include cell, tissue and organ per se and 

4 

when within an organism. 

The term "organism" in relation to the presem invention includes any organism 

10 that could comprise the nucleotide sequence accordii^ to the present invention and/or 
wherein the nucleotide sequence according to the present invention can be expressed wheii 
present in the organism. Preferably the organism is a starch producii^s organism such as 
any one of a plant, algae, fiuigi, yeast and bacteria, as well as cell lines thereof. 
Preferably the organism is a plaru. 

15 ' The term "starch producing organism" includes any organism that can 
biosyitthesise starch. Preferably, the starch producing organism is a plant. 

The term ''plant" as used herein includes ariy suitable angiosperm, gymnospetm, 
monocotyledon and dicotyledon. Typical examples of suitable plants include vegetables 
such as potatoes; cereals such as wheat, maize, and barley; fruit; trees; flowers; and other 

20 plant crops. Preferably, the term means "potato". 

The term "transgenic organism" in relation to the present invention includes any 
organism that comprises the nucleotide sequence according to the present invention and/or 
products obtained therefrom, and/or wherein the nucleotide sequence accorcUng to the 
present invention can be expressed within the organism. Preferably the nucleotide 

25 sequence of the present invention is incorporated in the genome of the organism. 
Preferably the transgenic organism is a plaiu, more preferably a potato. 

To prepare the host organism one can use prokaryotic or eukaryotic organisms. 
Examples of suitable prokaryotic hosts include £. coU aiKl Bacillus subiilis. Teachings on 
the transformation of prokaryotic hosts is well documented in the art, for example see 

30 Sambrook et al (Sambrook et al. in Molecular Cloning: A Laboratory Manual, 2nd 
edition, 1989, Cold Spring Harbor Laboratory Press). 
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Even though the enzyme according to die present invention and the nucleotide 
sequence coding for same are not disclosed in EP-B-047014S and CA-A-20064S4, those 
two documents do provide some useful background commentary on die ^pes of 
techniques that may be employed to prepare transgenic plants accord'mg to the present 
5 invention. Some of these background teachings are now included in the following 
commentary. 

The basic principle in the construction of genetically modified plants is to insert 
genetic information in the plant genoihe so as to obtain a stable maintenance of the 
inserted genetic material. 

10 Several techniques exist for inserting die genetic information, die two main 

principles being direct introduction of the genetic information and introduction of the 
genetic information by use of a vector system. A review of the general techniques may 
be found in articles by Potrykus (Annu Rev Plant Physiol Plant Mol Biol [1991] 42:205- 
225) and Christou (Agro-Food-Industry Hi-Tech March/April 1994 17-27). 

15 Thus, in one aspect, the present invention relates to a vector system which carries 

a nucleotide sequence or construct according to the present invention and whidi is capable 
of introducing the nucleotide sequence or construa into the genome of an organism, such 
as a plant. 

The vector system may comprise one vector, but it can comprise two vectors. In 
20 the case of two vectors, die veaor system is normally referred to as a binary veaor 
system. Binary vector systems are described in fiirther detail in Gynheung An er a/. 
(1980), Binary Vectors, Plant Molecular Biology Mamal A3, M9. 

One extensively employed system for transformation of plant cells with a given 
promoter or nucleotide sequence or construct is based on the use of a Ti plasmid from 
25 Agrobacierium tumefaciens or a Ri plasmid from Agrobacterium rhizogenes An er a/. 
(1986), Plant PhysioL 81, 301-305 and Butcher D.N. et at. (1980), Tissue CuUure 
Methods for Plant Pathologists, eds.: D.S. Ingrams and J.P. Helgeson, 203-208. Several 
different Ti and Ri plasmids have been constructed which are suitable for the construction 
of the plant or plant cell constructs described above. A non-limitii^ example of such a Ti 
30 plasmid is pGV3850. 
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The nucleotide sequence or construct of the present invention should preferably be 
inserted into the Ti-plasmid between the terminal sequences of the T-DNA or adjacent a 
T-DNA sequence so as to avoid disruption of the sequences immediately surrounding the 
T-DNA borders, as at least one of these regions appears to be essential for insertion of 

5 modified T-DNA into the plaiu genome. 

As will be understood from the above explanation, if the organism is a plant the 
vector system of the present invemion is preferably one which contains the sequences 
necessary to infect the plaiu (e.g. the vir region) and at least one border part of a T-DNA 
sequeike, the border part beir^ located on the same vector as the genetic construct. 

10 Funhermore, the vector system is preferably an Agrobaaerium tumefaciens Ti- 

plasmid or an Agrobaaerium rhizpgenes Ri-plasmid or a derivative thereof. As these 
plasmids are well-known and widely employed in the construction of transgenic plants, 
many vector systems exist which are based on these plasmids or derivatives thereof. 

In the construction of a transgenic plant the nucleotide sequence or construct of 

IS the present invention may be first constructed in a microorganism in which the vector can 
replicate and which is easy to manipulate before insertion into the plant. An example of a 
useful microorganism is E. colL but other microorganisms having the above properties 
may be used. When a vector of a vector system as defined above has been constructed in 
E. coll, it is transferred, if necessary, into a suitable Agrobaaerium strain, e.g. 

20 Agrobaaerium tumefaciens. The Ti-plasmid harbouring the nucleotide sequence or 
construct of the present invention is thus preferably transferred into a suitable 
Agrobaaerium strain, e.g. A. (umefacienSy so as to obtain an Agrobaaerium cell 
harbouring the promoter or nucleotide sequence or construa of the present invention, 
which DNA is subsequently transferred into the plant cell to be modified. 

25 If, for example, for the transformation the Ti- or Ri-plasmid of the plant cells is 

used, at least the right boundary and often however the right and the left boundary of the 
Ti- and Ri-plasmid T-DNA, as flanking areas of the introduced genes, can be connected. 
The use of T-DNA for the transformation of plant cells has been intensively studied and is 
described in EP-A-120516; Hoekema, in: The Binary Plant Vector System Offset- 

30 drukkerij Kanters B.B., Alblasserdam, 1985, Chapter V; Fraley, et aL, Crit. Rev. Plant 
Sci., 4:1-46; and An et aL, EMBO J. (1985) 4:277-284. 
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Direct infeaion of plant tissues by Agrobaaerium is a simple technique which has 
been widely employed and which is described in Butcher D.N. et al. (1980), Tissue 
Cubure Methods for Plant Pathologists, eds.: D.S. Ii^rams and J.P. Helgeson. 203-208. 
For further teachings on this topic see Potxykus (Annu Rev Plant Physiol Plant Mol Biol 

S [1991] 42:205-225) and Christou (Agro-Food-Industry Hi-Tech March/i^ril 1994 17- 
27). With this technique, infection of a plant may be perfonned in or on a certain part or 
tissue of the plant, i.e. on a part of a leaf, a root, a stem or another pan of the plaiu. 

Typically, with direct infection of plant tissues by Agrobacterium carrying the 
GOI (such as the nucleotide sequence accordii^ to the present invention) and, optionally, 

10 a promoter, a plant to be infected is wounded, e.g. by cutting the plant with a razor blade 
or puncturing the plant with a needle or rubbing the plant with an abrasive. The wound is 
then inoculated with the Agrobaaerium, The inoculated plant or plant part is then grown 
on a suitable culture medium and allowed to develop into mature plaius. 

When plant cells are constructed* these ceUs may be grown and maintained in 

15 accordance with well-known tissue culmring methods such as by culturing the cells in a 
suitable culture medium supplied with the necessary growth factors such as amino acids, 
plant hormones, vitamins, etc. 

Regeneration of the transformed cells into genetically modified plants may be 
accomplished using known methods for the regeneration of plants from cell or tissue 

20 cultures, for example by selecting transformed shoots using an antibiotic and by 
subculmring the shoots on a medium containing the appropriate nutrients, plant 
hormones, etc. 

Further teachings on plant transformation may be found in EP-A-0449375. 
As reported in CA-A-2006454, a large amount of cloning vectors are available 
25. which contain a replication system in £. coU and a marker which allows a selection of the 
transformed cells. The vectors contain for example pBR 322, pUC series, M13 mp 
series, pACYC 184 etc. In this way, the nucleotide or construct of the present invention 
can be introduced into a suitable restriction position in the vector. The contained plasmid 
is Uien used for the transformation in Exoli. The E.coli cells are cultivated in a suitable 
30 nutrient medium and then harvested and lysed. The plasmid is then recovered. As a 
method of analysis there is generally used sequence analysis, restriction analysis. 
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electrophoresis and further biochemical-molecular biotogical methods. After each 
manipulation, the used DNA sequence can be restricted and connected with the next DNA 
s^uence. Each sequence can be cloned in the same or different plasmid. 

After the introduction of the nucleotide sequence or construct according to the 
S present invention in the plants the presence and/or insertion of further DNA sequences 
may be necessary - such as to create combination systems as oudined above (e.g. an 
organism comprising a combination of constructs). 

The above commentary for the transformation of prokaryotic organisms and plants 
with the nucleotide sequence of the present invention is equally applicable for the 
10 transformation of those organisms with the promoter of the present invention. 

In summation, the present invention relates to affecting enzyme activity by 
expressing antisense intron sequences. 

Also, the present invention relates to a promoter useful for the expression of those 
antisense intron sequences. 
IS The foUowing samples have been deposited in accordance with the Budapest 

Treaty at the recognised depositary The National Collections of Industrial and Marine 
Bacteria Limited (NCIMB) at 23 St Machar Drive, Aberdeen, Scodand, AB2 IRY, 
United Kingdom, on 13 July 1995: 

NCIMB 407S3 (which refers to pBEA 8 as described herein); 
20 NCIMB 40751 (which refers to X-SBE 3.2 as described herein), and 

NCIMB 40752 (which refers to X-SBE 3.4 as described herein). 
The following sample has been deposited in accordance with the Budapest Treaty 
at the recognised depositary The National Collections of Industrial and Marine Bacteria 
Limited (NCIMB) at 23 St Machar Drive, Aberdeen, Scotland, AB2 IRY, United 
25 Kingdom, on 9 July 1996: 

NCIMB 40815 (which refers to pBEA 9 as described herein). 
A highly preferred embodiment of the present invention therefore relates to a 
method of affectmg enzymatic activity in a plant (or a cell, a tissue or an organ thereoQ 
comprising expressing in the plant (or a cell, a tissue or an organ thereoQ a nucleotide 
30 sequence wherein the nucleotide sequence codes, partially or completely, for an intron in 
an antisense orientation; wherein the nucleotide sequence does not contain a sequence that 
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is antisense to an exon sequence normally associated with the imron; wherein starch 
branching enzyme activity is affected and/or the levels of amylopectin are affected and/or 
the composition of starch is changed; and wherein the nucleotide sequence is antisense to 
intron 1 of class A SBE as set forth in SEQ. ID. No. 38, or any other intron of class A 

5 SBE. including fragments thereof, and including combinations of class A antisense intron 
sequences and class B antisense imron sequences. The sequence of introns of class A 
SBE other than intron 1 may be obtained by sequencing of, for example, potato class A 
SBE genomic DNA, isolatable by hybridisation screening of a genomic DNA library with 
class A SBE cDNA obtainable according to W096/34968 according to methods well 

10 known in the art and set forth, for example, in Sambrook et aL^ Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor. 1989. 

The presem invention will now be described orily by way of example, in which 
reference is made to the following attached Figures: 

Figure 1, wliiich is a schematic representation of the biosynthesis of amylose and 

15 amylopectin; 

Figure 2, which is a diagrammatic representation of the a*l-4-links and the a-1-6 
links of amylopectin; 

Figure 3, which is a diagrannmatic representation of a possible amisense-RNA 
inhibition mechanism; 

20 Figure 4, which is a diagrammatic representation of the exon-intron strucmre^of a 

genomic SBE clone; 

Figure 5, which is a plasmid map of pPATAI, which is 3936 bp in size; 

Figure 6, which is a plasmid map of pABE6, which is 5106 bp in size; 

Figure 7, which is a plasmid map of pVictorlV Man, which is 7080 bp in size; 
25 Figure 8, which is a plasmid map of pBEAS, which is 9.S4 kb in size; 

Figure 9, which is a plasmid map of pBEA9, which is 9.54 kb in size; 

Figure 10, which is a plasmid map of pBEP2, which is 10.32 kb in size; 

Figure 1 1, which is a plasmid map of pVictor5a, which is 9.12 kb in size; 

Figure 12, which shows the full genomic nucleotide sequence for SBE including 
30 the promoter, exons and introns; 
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Figure 13, which shows the positioning of intron 1 in the class A and class B SBE 

genes: 

Figure 14, which shows the sequence of intron 1 of the potato class A SBE; 
Figure 15, which shows the structure of pSS17; and 

5 Figure 16, which shows the sirucnire of pSS18. 

Figures 1 azKl 2 were referred to above in the introduaory description concerning 
starch in general. Figure 3 was referred to above in the introductory description 
concerning antisense expression. 

As mentioned. Figure 4 is a diagranunatic representatioii of the exon-intron 

10 structure of a genomic SBE clone, the sequence of which is shown in Figure 12. This 
clone, which has about 11.5 k base pairs, comprises 14 exons and 13 introns. The 
incrons are numbered in increasing order from the 5' end to the 3* end and correspond to 
SEQ.I.D.No.s 1-13, respectively. Their respective aniisense intron sequences are shown 
as SEQ.I.D.No.s 15-27. 

15 In more detail. Figures 4 and 12 present mformation on the 11478 base pairs of a 

potato SBE gene. The 5* region from nucleotides 1 to 2082 contain the promoter region 
of the SBE gene. A TATA box candidate at nucleotide 2048 to 20S1 is boxed. The 
homology between a potato SBE cDNA clone (Poulsen & Kreiberg (1993) Plant Physiol 
102: 1053-1054) and the exon DNAs begin at 2083 bp and end at 9666 bp. 

20 The homology between the cDNA and the exon DNA is indicated by nucleotides 

in upper case letters, while the translated amino acid sequences are shown in the sii^le 
letter code below the exon DNA. Intron sequences are indicated by lower case letters. 

Figures 5 to 7 are discussed below. As mentioned. Figure 8 is a plasmid map of 
pBEAS, which is 9.54 k base pairs in size; and Figure 9 is a plasmid map of pBEA9, 

25 which is 9.54 k base pairs in size. Each of pBEA 8 and pBEA 9 comprises an antisense 
sequence to the first intron sequence of the potato SBE gene. This first intron sequence, 
which has 1 177 base pairs, is shown in Figure 4 and lies between the first exon and the 
second exon. 

These experiments and aspects of the present invention are now discussed in more 

30 detail. 
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RyPFRTMENTAL PROTOCOL 

ISOLATION, SUBCLONING IN PLASMIDS, AND SEQUENCING OF GENOMIC 
CLASS B SEE CLONES 

S Various clones containing the potato class B SBE gene are isolated from a Desiree 

potato genomic library (Clontech Laboratories Inc., Palo Alto CA» USA) using 
radioactively labelled potato SBE cDNA (Poulsen Sl Kreiberg (1993) Plant Physiol. 
102:1053-1054) as probe. The fragments of the isolated X-phages containing SBE DNA 
(XSBE 3.2 - NCIMB 40751 - and XSBE-3.4 - NCIMB 40752) are identified by Southern 

10 analysis and then subcloned into pBluescript 11 vectors (Clontech Laboratories Inc., Palo 
Alto CA, USA). XSBE 3.2 contains a 15 kb potato DNA insert and XSBE-3.4 contains a 
13 kb potato DNA insert. The resultant plasmids are called pGB3, pGBll, pGB15, 
pGB16 and pGB25 (see discussion below). The respective inserts are then sequenced 
using the Pharmacia Autoread Sequencing Kit (Pharmacia. Uppsala) and a A.L.F. DNA 

15 sequencer (Pharmacia, Uppsala). 

In totaU a stretch of 11.5 kb of the class B SBE gene is sequenced. The sequence 
IS deduced from the above-mentioned plasmids, wherein: pGB25 contains the sequences 
from 1 bp to 836 bp, pGB15 contains the sequences from 735 bp to 2580 bp. pGB16 
contains the sequences from 2580 bp to 5093 bp, pGBll contains the sequences from 

20 3348 bp to 7975 bp, and pGB3 contains the sequences from 7533 bp to 1 1468 bp. 

In more detail, pGB3 is constructed by insertion of a 4 kb EcoRl fragment isolated 
from )iSBE 3.2 into the EcoRL site of pBluescript II SK (+). pGBll is constructed by 
insertion of a 4.7 kb Xhol fraement isolated from XSBE 3.4 into the Xliol site of 
pBluescript II SK (+). pGB15 is constructed by insertion of a 1.7 kb Spel fragment 

25 isolated from A5BE 3.4 into die Spel site of pBluescript 11 SK (+). pGB16 is constructed 
by insertion of a 2.5 kb Spel fragment isolated from XSBE 3.4 into the Spel site of 
pBluescript 11 SK (+). For the construction of pGB25 a PGR fragment is produced with 
the primers 

5' GGA ATT CCA GTC GCA GTC TAG ATT AC 3' 
30 (SEQ. ID. No.30) 

and 
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5' CGG GAT CCA GAG GCA TTA AGA TTT CTG G 3* 

(SEQ. ID. No. 31) 

and XSBE 3.4 as a template. 

The PCR fragment is digested with BamHl and EcoKl, and inserted in pBluescript 
5 II SK (+) digested with the same restriction enzymes. 
A class A SEE clone is derived similarly. 

CONSTTRUCTION OF CLASS B SBE ANTISENSE INTRON PLASMTOS pBEAS 
and pBEA9 

10 The SBE intron 1 is anq)lified by PCR using the oligonucleotides: 

5* CGG GAT CCA AAG AAA TTC TCG AGG TTA CAT GG 3* 

(SEQ. ID. No. 32) 

and 

5' CGG GAT CCG GGG TAA TTT TTA CTA ATT TCA TG 3* 
15 (SEQ. ID. No. 33) 

and the XSBE 3.4 phage containing the SBE gene as template. 

The PCR product is digested with BamHl and inserted in an antisense orientation 
in the BamHl site of plasmid pPATAl (described in WO 94^4292) between the patatin 
promoter and the 35S terminator. This construction, pABE6, is digested with Kpnl, and 
20 the 2.4 kb "patatin promoter-SBE intron 1- 3SS terminator"* Kpnl fragment is isolated and 
inserted in the f^nl site of the plant transformation vector pVictorlV Man. The Kpnl 
fragment is insened in two orientations yielding plasmids pBEA8 and pBEA9. pVictorFV 
Man is shown in Figure 7 and is formed by insertion of a filled in Xbal fragment 
containing a E3SS promoter-manA-3SS terminator casseae isolated from plasmid 
25 pVictorlV SGiN Man (WO 94/24292) into the filled in Xhol site of pVictor IV. The 
pVictor regions of pVictor IV Man contained between the co-ordinates 2.52 bp to 0.32 bp 
(see Figure 7). 



1 
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CONSTRUCTION OF CLASS A SBE ANTISENSE INTRON PLASMIDS pSSl? 
andpSSlS 

Construction of plasmid pSSlT. 

5 The 2122 bp intron 1 sequence of the potato SBEII gene is amplified by PCR 

from a genomic SBEII subclone using the primers 5' - CGG GAT CCC GTA TGT 
CTC ACT GTG TTT GTG GC - 3' (SEQ. ID. No. 34) and 5' - CGG GAT CCC CCT 
ACA TAC ATA TAT CAG ATT AG - 3* (SEQ. ID. No. 35). The PCR product is 
digested with BamHI and inserted in antisense orientation after a patatin promoter in the 

10 BamHI site of a plant transformation vector in which the NPTII gene is used as 
selectable marker (see figure IS). 



Construction of plasmid pSSlS. 

The 2122 bp intron 1 sequence of the potato SBEH gene is amplified by PCR 

15 from a genomic SBEII subclone using the primers 5' - CGG GAT CCC GTA TGT 
CTC ACT GTG TTT GTG GC - 3' (SEQ. ID. No. 34) and 5' - CGG GAT CCC CCT 
ACA TAC ATA TAT CAG ATT AG - 3' (SEQ. ID. No. 35). The PCR product is 
digested with BamHI and inserted in antisense orientation after a patatin promoter in the 
BamHI site of a plant transformation vector in which the manA gene is used as 

20 selectable marker (see figure 16). 

PRODUCTION OF TRANSGENIC POTATO PLANTS 

Axcnic stock cultures 

Shoot cultures of Solanum tuberosum *Bintje* and *Dianella' are maintained on a 
25 substrate (LS) of a formula according to Linsmaier, E.U. and Skoog, F. (1965), Physiol. 
Plant, 18: 100-127, in addition containing 2 jiM silver thiosulphaie at 25°C and 16 h 
light/8 h dark. 

The culmres are subcultured after approximately 40 days. Leaves are then cut off 
the shoots and cut into nodal segments (approximately 0.8 cm) each containing one node. 

30 
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Inoculation of potato tusugs 

Shoots from approximately 40 days old shoot culmres (height approximately 5-6 
cms) are cut into intemodal segments (approximately 0.8 cm). The segments are placed 
into liquid LS-substrate containii^ the transformed Agrobacterium lumefadens containing 
5 the binary vector of interest. The Agrobaaerium are grown overnight in YMB-substrate 
(di*potassium hydrogen phosphate, trihydrate (0.66 g/1); magnesium sulphate, 
heptahydrate (0.20 g/1); sodium chloride (0.10 g/1); mannitol (10.0 g/1); and yeast extract 
(0.40 g/1)) containing appropriate antibiotics (corresponding to the resistance gene of the 
Agrobacterium strain) to an optical density at 660 nm (00-660) of approximately 0.8, 
10 centrifuged and resuspended in the LS-substrate to an OD-660 of O.S. 

The segments are left in the suspension of Agrobaaerium for 30 minutes and then 
the excess of bacteria are removed by bloning the segments on sterile filter paper. 

rn^ulrivation 

IS The shoot segments are co-cultured with bacteria for 48 hours directly on LS- 

substrate containing agar (8.0 g/1), 2, 4-dichlorophenoxy acetic acid (2.0 mg/l) and trans- 
zeatin (0.2 mg/l). The substrate and also the explants are covered with sterile filter 
papers, and the petri dishes are placed at 2S^C and 16 h light/ 8 dark. 

20 "Washing" procedure 

After the 48 h on the co-cultivation substrate the segments are transferred to 
containers containing liquid LS-substrate containing 800 mg/l carbenicillin. The 
containers are gently shaken and by this procedure the major part of the Agrobaaerium is 
either washed off the segments and/or killed. 

25 

After the washing procedure the segments are transferred to plates containing the 
LS-subsffaie, agar (8 g/1), trans-zeatin (1-5 mg/l), gibberellic acid (0.1 mg/I), 
carbenicillin (800 mg/l), and kanamycin sulphate (50-100 mg/l) or phosphinotricin (1-5 
30 mg/l) or mannose (5 g/1) depending on the vector consmiction used. The segments are 
sub-culmred to fresh substrate each 3-4 weeks. 
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In 3 to 4 weeks, shoots develop from the segments and the formation of new 
shoots continued for 3-4 months. 

Rnnrinp of regenerated xhonK 

S The regenerated shoots are transferred to rooting substrate composed of LS- 

substrate. agar (8 g/1) and catbenicillin (800 mg/1). 

The transgenic genotype of the regenerated shoot is verified by testing the rooting 

ability on the above mentioned substrates containing kanamycin sulphate (200 mg/1). by 

performing NPTII assays (Radke, E. et al, Theor. Appl. Genet. (1988), 75: 685-694) 
10 or by performing PCR analysis accordtog to Wang e/ at (1993. NAR 21 pp 4153-4154). 

Plants which are not positive in any of these assays are discarded or used as controls. 

Alternatively, the transgenic plants could be verified by performing a GUS assay on the 

co-introduced p-glucuronidase gene according to Hodal, L. et al. (PL Sci. (1992). 87: 

1 15-122). 

15 

Transfer to soil 

The newly rooted plants (height approx. 2*3 cms) are transplanted from rooting 
substrate to soil and placed in a growth chamber (2rc, 16 hour light 20(M00uE/mVsec). 
When the plants are well established they are transferred to the greenhouse, where they 
20 are grown until tubers had developed and the upper part of the plants are senesciiig. 

Harvesting 

The potatoes are harvested after about 3 months and then analysed. 

25 BRANCHING ENZYME ANALYSIS 

The class A and class B SBE expression in the transgenic potato lines is measured 
using the SBE assays described by Blennow and Johansson (Phytochemistry (1991) 
30:437-444) and by standard Western procedures using antibodies directed against potato 
SBE. 
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STARCH ANALYSIS 

Starch is isolated from potato tubers and analysed for the amylose:amylopectin 
ratio (Hovenkamp-Hermeiink et al. (1988) Potato Research 31:241-246). In addition, the 
chain length distribution of amylopectin is determined by analysis of isoanQ^lase digested 
5 starch on a Dionex HPAEC. 

The number of reducing ends in isoamylase digested starch is determined by the 
method described by N. Nelson (1944) J. Biol.Chem. 153:375-380. 

The results reveal that there is a reduction in the level of synthesis of SBE and/or 
the level of activity of SBE and/or the composition of starch SBE in the transgenic plants. 

10 

CONSTRUCTION OF SBE PROMOTER CONSTRUCT 

An SBE promoter fragment is amplified from X-SBE 3.4 using primers: 
5 ' CCA TCG ATA CTT TAA GTG ATT TGA T(jG C 3* 

(SEQ. ID. No. 36) 

15 and 

5 • CGG gat CCT GTT CTG ATT CTT GAT TTC C 3' . 

(SEQ. ID. No. 37) 

The PGR product is digested with Clal and BamHl. The resultant 1.2 kb fragment is then 
inserted in pVictor5a (see Figure 1 1) linearised with Cldl and BgUl yielding pBEP2 (see 
20 Figure 10). 

CTARCH BRANCHING FN7YMF MKASTTRFMFNTS OF POTATO TUBERS 

Potatoes from potato plants transformed with pBEAS, pBEA9, pSS17 or pSSlS 

are cut in small pieces and homogenised in extraction buffer (50 mM Tris-HCl pH 7.5, 
25 Sodium-dithionite (0.1 g/1). and 2 mM DTT) using a Ultra-Turax homogenizer; 1 g of 

Dowex xl. is added pr. 10 g of tuber. The crude homogenate is filtered through a 

miracloth filter and cenirifuged at 4°C for 10 minutes at 24.7(X) g. The supernatant is 

used for starch branching enzyme assays. 

The starch branching enzyme assays are carried oiu at 25^C in a volume of 400 ^1 
30 composed of 0.1 M Na citrate buffer pH 7.0, 0.75 mg/ml amylose, 5 ix^/ml bovine 

serum albumin and the potato extract. At 0, IS, 30 and 60 minutes aliqouts of SO ^l are 
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removed from the reaction into 20 ^1 3 N HCl. 1 ml of iodine solution is added and the 
decrease in absorbance at 620 nm is measured with an EUSA spectrophotometer. 

The starch branching en^me (SEE) levels are measured in tuber extracts from 34 
transgenic Dianella potato plants transformed with plasmtd pBEAS, pSS17 and pSSlS. 
S The transformed transgenic lines produce tubers which have SEE levels that are 

10% to 15% of the appropriate class A or class B SHE levels found in non transformed 
Dianella plants. 

In a further experiment, plasmids pSS17 and pBEAS are cotransfected into potato 
plants, as described above. In the cotransfectants, when analysed as set fordi above, 
10 simultaneous reduction of class A and class B SBE levels are observed. 

SUMMATION 

The above-mentioned examples relate to the isolation, sequencing and utilisation 
of antisense intron constructs derived from a gene for potato class A and class B SBE. 
IS These SBE intron antisense constructs can be introduced into plants, such as potato 
plants. After introduction, a reduction in the level of synthesis of SBE and/or the level of 
activity of SBE and/or the composition of starch in plants can be achieved. 

Without wishing to be bound by theory it is believed that the expressed anti-sense 
nucleotide sequence of the present invention binds to sense introns on pre-mRNA and 
20 thereby prevents pre-mRNA splicing and/or subsequent translation of mRNA. This 
binding therefore is believed to reduce the level of plant enzyme activity (in particular 
class A and class B SBE activity), which in turn for SBE activity is believed to influence 
the amylose:amylopectin ratio and thus the branching pattern of amylopectin. 

Thus, the present invention provides a method wherein it is possible to manipulate 
25 the starch composition in plants, or tissues or cells thereof, such as potato tubers, by 
reducing the level of SBE activity by using an antisense-RNA techiuque \ismg antisense 

■ 

intron sequences. 

The simultaneous reduction or efmiinaiion of class A and class B SBE sequences 
from the doubly transformed potato plants, moreover, offers the possibility to transform 
30 such plants with different SBE genes at will, thus allowing the manipulation of branching 
in starch accordine to the desired result. 
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Other modifications of the present invention will be apparent to those skilled in the 
art without departing from the scope of the present invention. 

The following pages present a number of sequence listings which have been 
consecutively numbered from SEQ.I.D. No. I - SEQ.LD. No. 38. In brief, SEQ.I.D. 
S No. 1 - SEQ.I.D. No. 13 represent sense imron sequences (genomic DNA); SEQ.I.D. 
No. 14 represents the SBE promoter sequence (genomic sequence); SEQ.LD. No. IS - 
SEQ.I.D. No. 27 represent antisense intron sequences; and SEQ. I.D. No. 28 represents 
is the sequence complementary to the SBE promoter sequence - i.e. the SBE promoter 
sequence in antisense orientation. The full genomic nucleotide sequence for class B SBE 
10 including the proinoter, exons and introns is shown as SEQ. I.D. No. 29 and is explained 
by way of Figures 4 and 12 which highlight particular gene feamres. SEQ. ID. No. 30 to 
37 show primers used in the mediods set forth above. SEQ. ID. No. 38 shows the 
sequence of intron 1 of class A SBE. 
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SEQUENCE LISTING 

(X) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: DANISCO A/S 

(B) STREET: LANGEBROGADE 1 

(C) CITY: COPENHAGEN K 

(E) COUNTRY: DENMARK 

(F) POSTAL CODE (ZIP) ; DK-1001 

(ii) TITLE OF INVENTION: INHIBITION OF GENE EXPRESSION 
(iii) NUMBER OF SEQUENCES: 38 

(iv) COMPUTER READABLE FORM: 

(A> MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC con^atibXe 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patetitln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(XV) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GTAATTTTTA CrTAATTTCAT GTTAATTTCA ATTATTTTTA GCCTTTGCAT TTCATTTTCC 60 

AATATATCTG GATCATCTCC TTAGTTTTTT ATTTTATTTT TTATAATATC AAATATQGAA 120 

GAAAAATGAC ACTTGTAGAG CCATATGTAA GTATCAT6TG ACAAATTT6C AA(3GTGGTT6 180 

AGTGTATAAA ATTCAAAAAT TGAGAGATGG AGGGGGGGT6 GGGGAAGACA ATATTTAGAA 240 

AGAGTGTTCT AGGAGGTTAT GGAGGACACG GATGAGGGGT AGAAGGTTAG TTAGGTATTT 300 

GAGTGTTGTC TGGCTTATCC TTTCATACTA GTAGTCGTGG AATTATTTGG GTAGTTTCTT 360 

GTTTTGTTAT TTGATCTTTG TTATTCTATT TTCTGTTTCT TGTACTTCGA TTATTGTATT 420 

ATATATCTTG TCGTAGTTAT TGTTCCTCGG TAAGAATGCT CTAGCATGCT TCCTTTAGTG 480 
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TTTTATCATG CCTTCTTTAT ATTOGCXSTTG CTTTGAAATG CTTTTACTTT A6CCGAGGGT 
CTATTAGAAA CAATCTCTCT ATCTCGTAAG GTAGGGGTAA AGTCCTCACC ACACTCCACT 

5 

TGTGGGATTA CATTGTGTTT GTTGTTGTAA ATCAATTAT6 TATACATAAT AAGTG6ATTT 
TTTACAACAC AAATACATGG TCAAGGGCAA AGTTCTGAAC ACATAAAGG6 TTCATTATAT 
10 GTCCAGGGAT ATGATAAAAA TTGTTTCTTT GTGAAAGTTA TATAAGATTT GTTATGGCTT 
TTGCTGGAAA CATAATAAGT TATAATGCTG AGATAGCTAC TGAAGTTTGT TTTTTCTAGC 



15 



20 



25 



30 



35 



CTTTTAAATG TACCAATAAT AGATTCCGTA TCGAACGAGT ATGTTTTGAT TACCTGGTCA 
TGATGTTTCT ATTTTTTACA TTTTTTTGGT GTTGAACTGC AATTGAAAAT GTTGTATCCT 
ATGAGACGGA TAGTTGAGAA TGTGTTCTTT GTATGGACCT TGAGAAGCTC AAACGCTACT 

# 

CCAATAATTT CTATGAATTC AAATTCAGTT TATGGCTACC AGTCAGTCCA GAAATTAGGA 
TATGCTGCAT ATACTTGTTC AATTATACTG TAAAATTTCT TAAGTTCTCA AGATATCCAT 
GTAACCTCGA GAATTTCTTT GACAG 
(2) INFORMATION PPR 5EQ ZD NO: 2: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) I^GTK: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TVPE: ONA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



540 
600 
660 
720 
780 
640 
900 
960 
1020 
lOBO 
1140 
1165 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
45 GTATGTTTGA TAATTTATAT GGTTGCATGG ATAGTATATA AATAGTTGGA AAACTTCTGG 
ACTGGTGCTC ATGGCATATT TGATCTGTGC ACCGTGTGGA GATGTCAAAC ATGTGTTACT 



50 



TCGTTCGGCC AATTTATAAT ACCTTAACTT GGGAAAGACA GCTCTTTACT CCTGTGGGCA 
TTTGTTATTT GAATTACAAT CTTTATGAGC ATGGTGTTTT CACATTATCA ACTTCTTTCA 
TGTGGTATAT AACAGTTTTT AGCTCCGTTA ATACCTTTCT TCTTTTTGAT ATAAACTAAC 
55 TGTGGTGCAT T6CTTGC 

(2) INFORMATION FOR SEQ ID NO: 3: 



60 
120 
180 
240 
300 
317 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : sin9le 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GTAACAGCCA AAAGTTGTGC TTTAGGCAGT TTGACCTTAT TTTGGAAGAT GAATTGTTTA 60 
TACCTACTTT GACTTTGCTA GAGAATTTTG CATACCGGGG AGTAAGTAGT GGCTCCATTT 120 
AGGTGGCACC TGGCCATTTT TTTGATCTTT TAAAAAGCTG TTTGATTGGG TCTTCAAAAA 180 
AGTAGACAAG GTTTTTGGAG AAGTGACACA CCCCCGGAGT GTCAGTGGCA AAGCAAA6AT .240 
TTTCACTAAG GAGATTCAAA ATATAAAAAA AGTATAGACA TAAAGAAGCT GAGGGGATTC 300 
AACATGTACT ATACAAGCAT CAAATATAGT CTTAAAGCAA TTTTGTAGAA ATAAAGAAAG ' 360 
TCTTCCTTCT GTTGCTTCAC AATTTCCTTC TATTATCATG AGTTACTCTT TCTGTTOGAA 420 
ATAGCTTCCT TAATATTAAA TTCAT6ATAC TTTTGTTGAG ATTTAGCAGT TTTTTCTTGT 480 
GTAAACTGCT CTCTTTTTTT GCAG 504 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHTUUVCTERISTICS : 

(A) LENGTH: 146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GTAGGTCCTC GTCTACTACA AAATAGTAGT TTCCATCATC ATAACAGATT TTCCTATTAA 60 
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AGCATGATGT TGCAGCATCA TTGGCTTrCT TACATOTTCT AATTGCTATT AAGGTTATGC 120 
TTCTAATTAA CTCATCCACA ATGCAG 146 
(2) INFORMATION FOR SEQ 123 NO: 5: 

(1) SEQUENCE CBARACTERISTICS: 

(A) LENGTH: 218 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GTTTTGTTAT TCATACCTTG AAGCTGAATT TTGAACACCA TCATCACAGG CATTTCGATT 60 

CATGTTCTTA CTAGTCTTGT TAT6TAAGAC ATTTTGAAAT GCAAAAGTTA AAATAATTGT 120 

GTCTTTACTA ATTTGGACTT GATCCCATAC TCTTTCCCTT AACAAAATGA GTCAATTCTA 180 

TAAGTGCTTG AGAACTTACT ACTTCAGCAA TTAAACAG 218 
(2) INFORMATION FOR SEQ ID MO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI> SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GTATTTTAAA TTTATTTCTA CAACTAAATA ATTCTCAGAA CAATTGTTAG ATAGAATCCA 60 

AATATATACG TCCTGAAAGT ATAAAAGTAC TTATTTTOGC CATGGGCCTT CAGAATATTG 120 

GTAGCCGCTG AATATCATGA TAAGTTATTT ATCCAGTGAC ATTTTTATGT TCACTCCTAT 180 

TATGTCTGCT GGATACAG 198 



wo 98/37213 



34 



PCT/ra98/0Q270 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TVPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TVPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GTTTGTCTGT TTCTATTGCA TTTT A AGGTT CATATAGGTT AGCCACGGAA AATCTCACTC 60 
TTTGTGAGGT AACCAGGGTT CTGATGGATT ATTCAATTTT CTCGTTTATC ATTTGTTTAT 120 
TCTTTTCATG CATTGTGTTT CTTTTTCAAT ATCCCTCTTA TTTGGAGGTA ATTTTTCTCA 180 
TCTATTCACT TTTAGCTTCT AACCACAG 208 
(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(XV) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GTATGTCTTA CATCTTTAGA TATTTTGTGA TAATTACAAT TAGTTTGGCT TACTTGAACA 60 
AGATTCATTC CTCAAAATGA CCTGAACTGT TGAACATCAA AGGGGTTGAA ACATAGAGGA 120 
AAACAACATG ATGAATGTTT CCATTGTCTA GGGATTTCTA TTATGTTGCT GAGAAC^UU^T 180 
GTCATCTTAA AAAAAACATT GTTTACTTTT TTGTAGTATA GAAGATTACT GTATAGAGTT 240 
TGCAAGTGTG TCTGTTTTC^G AGTAATTGTG AAATGTTTGA TGAACTTGTA GAG 293 
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(2) INFORMATION FOR SEQ ZD NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
20 GTTCAAGTAT TTTGAATCGC AGCTTGITAA ATAATCTAGT AATTTTTAGA TTGCTTACTT 
GGAAGTCTAC TTGGTTCTGG GGATGATACC TCATTTCATC TTGTTCTACT TATTTTCCAA 



25 



CCGAATTTCT GATTTTTGTT TCGAGATCCA AGTATTAGAT TCATTTACAC TTATTACCGC 
CTCATTTCTA CCACTAAGGC CTTGATGAGC AGCTTAAGTT GATTCTTTGA AGCTATA6TT 
TCAGGCTACC AATCCACAGC CTGCTATATT T6TT6GATAC TTACCTTTTC TTTACAATGA 
30 AGTGATACTA ATTGAAATGG TCTAAATCTG ATATCTATAT TTCTCCGTCT TTCCTCCCCC 
TCATGATGAA ATGCAG 



35 



40 



45 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL; NO 
(iv) ANTI-SENSE: NO 



60 
120 
180 
240 
300 
360 
376 



50 



(xa) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



GTAAAATCAT CTAAAGTTGA AAGT(jTT(3(3G TTTATGAAGT GCTTTAATTC TATCCAAGGA 
55 CAAC3TAGAAA CCTTTTTACC TTCCATTTCT TGATGATGGA TTTCATATTA TTTAATCCAA 
TAGCTGGTCA AATTCGGTAA TAGCTGTACT GATTAGTTAC TTCACTTTGC AG 



60 
120 
172 
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(2) INFOIUVITION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERZSTZCS : 
5 (A) LENGTH: 145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

15 



(Xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 11: 

20 

GTATATATGT TTTACTTATC CATGAAATTA TTGCTCTGCT TGTTTTTAAT GTACTGAACA 60 

AGTTTTATGG AGAAGTAACT GAAACAAATC ATTTTCACAT TGTCTAATTT AACTCTTTTT 120 

25 TCTGATCCTC GCATGACGAA AACAG 145 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERZSTZCS: 
30 (A) LENGTH: 242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECL'LE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

45 

GTAAGC3ATTT GCTTGAATAA CTTTTGATAA TAA(3ATAACA GATGTAGGGT ACAGTTCTCT 60 
CACCAAAAAG AACTGTTUITT GTCTTCATCCA T C TTT A GTTG TATAAGATAT CCGACTGTCT 120 
50 GAGTTCGGAA GTGTTTGAGC CTCCTGCCCT CCCCCTGCGT TGTTTAGCTA ATTCAAAAAG 180 



GAGAAAACTG TTTATTGATG ATCTTTGTCT TCATGCTGAC ATACAATCTG TTCTCATGAC 240 



AG 242 

55 

(2) INFORMATION FOR SEQ ID NO: 13: 



10 



25 



35 



45 



55 
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(i) SEQUENCE CRARACTERZSTICS : 

(A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GTACAGTTCT TGCCGTGTGA CCTCCCTTTT TATTGTGGTT TTGTTCATAG TTATTTGAAT 60 

20 GCGATAGAAG TTAACTATTG ATTACCGCCA CAATCGCCAG TTAAGTCCTC TGAACTACTA 120 

ATTTGAAAGG TAGGAATAGC CGTAATAAGG TCTACTTTTG GCATCTTACT GTTACAAAAC IBO 

AAAAGGATGC CAAAAAAATT CTTCTCTATC CTCTTTTTCC CTAAACCAGT GCATGTAGCT 240 

TGCACCTGCA TAAACTTAGG TAAATOATCA AAAATGAAGT TGATGGGAAC TTAAAACCGC 300 

CCTGAAGTAA AGCTAGGAAT AGTCATATAA TGTCCACCTT TGGTGTCTGC GCTAACATCA 360 

30 ACAACAACAT ACCTCGTGTA GTCCCACAAA GTGGTTTCAG GGGGAGGGTA GAGTGTATGC 420 

AAAACTTACT CCTATCTCAG AGGTAGAGAG GATTTTTTCA ATAGACCCTT GGCTCAAGAA 480 

AAAAAGTCCA AAAAGAAGTA ACAGAAGTGA AAGCAACATG TGTAGCTAAA GCGACCCAAC 540 

TTGTTTGGGA CTGAAGTAGT TGTTGTTGTT GAAACAGTGC ATGTAGATGA ACACATGTCA 600 

GAAAATGGAC AACACAGTTA TTTTGTGCAA GTCAAAAAAA TGTACTACTA TTTCTTTGTG 660 

40 CAGCTTTATG TATAGAAAAG TTAAATAACT AATGAATTTT GCTAGCAGAA AAATAGCTTG 720 

GAGAGAAATT TTTTATATTG AACTAAGCTA ACTATATTCA TCTTTCTTTT TGCTTCTTCT 780 

TCTCCTTGTT TGTGAAG 797 
(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2169 base pairs 
SO (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(lii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: KO 



5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

ATCATGGCCA ATTACTGGTT CAAATGCATT ACTTCCTTTC AGATTCTTTC 6AGTTCTCAT 60 

10 GACCGGTCCT ACTACAGACG ATACTAACCC GTGGAACTGT TGCATCTGCT TCTTAGAACT 120 

CTATGGCTAT TTTCGTTAGC TTGGCGTCGG TTTGAACATA GTTTTTGTTT TCAAACTCTT 180 

CATTTACAGT CAAAATGTTG TATGGTTTTT GTTTTCCTCA ATGATGTTTA CAGTGTTGTG 240 

15 

TTGTCATCTG TACTTTTGCC TATTACTTGT TTTGAGTTAC ATGTTAAAAA AGTGTTTATT 300 

TTGCCATATT TTGTTCTCTT ATTATTATTA TCATACATAC ATTATTACAA GGAAAAGACA 360 

20 AGTACACAGA TCTTAACGTT TATGTTCAAT CAACTTTrGG AGGCATTGAC AGGTACCACA 420 

AATTTTGAGT TTATGATTAA GTTCAATCTT AGAATATGAA TTTAACATCT ATTATAGATG 480 

CATAAAAATA GCTAATGATA GAACATTGAC ATTTGGCAGA GCTTAGOGTA TGGTATATCC 540 

25 

AACGTTAATT TAGTAATTTT TGTTACGTAC GTATATGAAA TATTGAATTA ATCACATGAA 600 

CGGTGGATAT TATATTATGA GTTGGCATCA GCAAAATCAT TGGTGTAGTT GACTGTAGTT 660 

30 GCAGATTTAA TAATAAAATG GTAATTAACG GTCGATATTA AAATAACTCT CATTTCAAGT 720 

GGGATTAGAA CTAGTTATTA AAAAAATGTA TACTTTAAGT GATTTGATGG CATATAATTT 780 

AAAGTTTTTC ATTTCATGCT AAAATTGTTA ATTATTGTAA TGTAGACTGC GACTGGAATT 840 

35 

ATTATAGTGT AAATTTATGC ATTCAGTGTA AAATTAAAGT ATTGAACTTG TCTGTTTTAG 900 

AAAATACTTT ATACTTTAAT ATAGGATTTT GTCATGCGAA TTTAAATTAA TCGATATTGA 960 

40 ACACGGAATA CCAAAATTAA AAAGGATACA CATGGCCTTC ATATGAACCG TGAACCTTTG 1020 

ATAACGTGGA AGTTCAAAGA AGGTAAAGTT TAAGAATAAA CTGACAAATT AA7TTCTTTT 1080 

ATTTGGCCCA CTACTAAATT TGCTTTACTT TCTAACATGT CAAGTTGTGC CCTCTTAGTT 1140 

45 

GAATGATATT CATTTTTCAT CCCATAAGTT CAATTTGATT GTCATACCAC CCATGAT6TT 1200 

CTGAAAAATG CTTGGCCATT CACAAAGTTT ATCTTAGTTC CTATGAACTT TATAAGAAGC 1260 

50 TTTAATTTGA CATGTTATTT ATATTAGATG ATATAATCCA TGACCCAATA GACAAGTGTA 1320 

TTAATATTGT AACTTTGTAA TTGAGTGTGT CTACATCTTA TTCAATCATT TAAGGTCATT 1380 

AAAATAAATT ATTTTTTGAC ATTCTAAAAC TTTAAGCAGA ATAAATAGTT TATCAATTAT 1440 

55 

TAAAAACAAA AAACGACTTA TTTATAAATC AACAAACAAT TTTAGATTGC TCCAACATAT 1500 



10 



20 
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TTTTCCAAAT TAAATGCAGA AAATGCATAA TTTTATACTT GATCTTTATA GCTTATTTTT 1560 

TTTAGCCTAA CCAACGAATA TTTGTAAACT CACAACTTGA TTAAAAGGGA TTTACAACAA X620 

5 GATATATATA AGTAGTGACA AATCTTGATT TTAAATATTT TAATTTGGAG GTCAAAATTT 1680 

TACCATAATC ATTTGTATTT ATAATTAAAT TTTAAATATC TTATTTATAC ATATCTAGTA 1740 

AACTTTTAAA TATACGTATA TACAAAATAT AAAATTATTG GCGTTCATAT TAGGTCAATA 1800 

AATCCTTAAC TATATCTGCC TTACCACTAG GAGAAAGTAA AAAACTCTTT ACCAAAAATA 1860 

CATGTATTAT GTATACAAAA AGTCGATTAG ATTACCTAAA TAGAAATTGT ATAACGAGTA 1920 

IS AGTAAGTAGA AATATAAAAA AACTACAATA CTAAAAAAAA TATGTTTTAC TTCAATTTCG 1980 

AAACTAATGG GGTCTGAGTG AAATATTCAG AAAGGGGAGG ACTAACAAAA GGGTCATAAT 2040 

GTTTTTTTAT AAAAAGCCAC TAAAATGAGG AAATCAAGAA TCAGAACATA CAAGAAGGCA 2100 

GCAGCTGAAG CAAAGTACCA TAATTTAATC AATGGAAATT AATTTCAAAG TTTTATCAAA 2160 

ACCCATTCG 2169 
25 (2) INFORMATION FOR SEQ ID NO: 15: • 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1165 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE; DMA (genomic) 
35 <iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CTGTCAAAGA AATTCTCGAG GTTACATGGA TATCTTGAGA ACTTAAGAAA TTTTACAGTA 60 
TAATTGAACA AGTATATGCA GCATATCCTA ATTTCTGGAC TGACTGGTAG CCATAAACTG 120 
AATTTGAATT CATAGAAATT ATTGGAGTAG CGTTTGAGCT TCTCAAGGTC CATACAAAGA 180 
50 ACACATTCTC AACTATCCGT CTCATAGGAT ACAACATTTT CAATTGCAGT TCAACACCAA 240 
AAAAATGTAA AAAATAGAAA CATCATGACC AGGTAATCAA AACATACTCG TTC6ATACGG 300 
AATCTATTAT TGGTACATTT AAAAGGCTAG AAAAAACAAA CTTCAGTAGC TATCTCAGCA 360 
TTATAACTTA TTATGTTTCC AGCAAAAGCC ATAACAAATC TTATATAACT TTCACAAAGA 420 



45 



55 
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20 



30 



35 



40 

AACAATTTTT ATCATATCCC TGGACATATA ATGAACCCTT TATGTGTTCA GAACTTTGCC 
CTTGACCATG TATTTGTGTT GTAAAAAATC CACTTATTAT GTATACATAA TTGATTTACA 
ACAACAAACA CAATGTAATC CCACAAGTGG A6TGTGGTGA GGACTTTACC CCTACCTTAC 
GAGATAGAGA GATTGTTTCT AATAGACCCT CGGCTAAAGT AAAAGCATTT CAAA6CAACG 
CGAATATAAA GAAGGCATGA TAAAACACTA AAGGAAGCAT GCTAGAGCAT TCTTACCGAG 
GAACAATAAC TACGACAAGA TATATAATAC AATAATCGAA GTACAAGAAA CAGAAAATA6 
AATAACAAAG ATCAAATAAC AAAACAAGAA ACTACCCAAA TAATTCCACG ACTACTAG7A 



15 TGAAAGGATA AGCCAGACAA CACTCAAATA CCTAACTAAC CTTCTACCCC TCATCCGTGT 
CCTCCATAAC CTCCTAGAAC ACTCTTTCTA AATATTGTCT TCCCCCACCC CCCCTCCATC 



TCTCAATTTT TGAA7TTTAT ACACTCAACC ACCTTGCAAA TTTGTCACAT GATACTTACA 
-TATGGCTCTA CAAGTGTCAT TTTTCTTCCA TATTTGATAT TATAAAAAAT AAAATAAAAA 
ACTAAGGAGA TGATCCAGAT ATATTGGAAA ATGAAATGCA AAGGCTAAAA ATAATTGAAA 
25 TTAACATGAA ATTAGTAAAA ATTAC 



(2) INFORMATION FOR SEQ ID NO: 16: 

(1) SEQUENCE CHARACISRISTICS : 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOUX3Y: linear 

r 

(li) MOLECULE TYPE; DNA (genomic) 
(iii) HYPOTHETICAL: NO. 
(iv) ANTI-SENSE: YES 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1165 



40 



45 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GCAAGCAATG CACCACAGTT AGTTTATATC AAAAAGAAGA AAGGTATTAA CGGAGCTAAA 
AACTGTTATA TACCACATGA AAGAAGTTGA TAATGTGAAA ACACCATGCT CATAAAGATT 



50 GTAATTCAAA TAACAAATGC CCACAGGAGT AAAGAGCTGT CTTTCCGAAG TTAAGGTATT 



ATAAATTGGC GGAACGAAGT AACACATGTT TGACATCTCC ACACGGTGCA CAGATCAAAT 
ATGCCATGAG CACCAGTCCA GAAGTTTTCC AACTATTTAT ATACTATCCA TGCAACCATA 
TAAATTATCA AACATAC 



60 
120 
180 
240 
300 
317 
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(2) IKFOKMATION FOR SEQ ID MO: 17: 

(1) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CTGCAAAAAA AGAGA6CAGT TTACACAAGA AAAAACT6CT AAATCTCAAC AAAAGTATCA 60 
TGAATTTAAT ATTAAGGAAG CTATTTCGAA CA6AAAGAGT AACTCATGAT AATAGAAGGA 120 
AATTGTGAAG CAACAGAAGG AAGACTTTCT TTATTTCTAC AAAATTGCTT TAAGACTATA 180 
TTTGATGCTT GTATAGTACA TGTTGAATCC CCTCAGCTTC TTTATGTCTA TACTTTTTTT 240 
ATATTTTGAA TCTCCTTAGT GAAAATCTTT GCTTTGCCAC TGACACTCCG GGGGTGTGTC 300 
ACTTCTCCAA AAACCTTGTC TACTTTTTTG AAGACCCAAT CAAACAGCTT TTTAAAAGAT ' 360 

•r 

CAAAAAAATG GCCAGGTGCC ACCTAAATGG AGCCACTACT TACTCCCCGG TATGCAAAAT 420 
TCTCTAGCAA AGTCAAAGTA GGTATAAACA ATTCATCTTC CAAAATAAGG TCAAACTGCC 480 
TAAAGCACAA CTTTTGGCTG TTAC 504 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTGCATTGTG GATGAGTTAA TTAGAAGCAT AACCTTAATA GCAATTAGAA CATGTAAGAA 60 
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AGCCAATGAT GCTGCAACAT CATGCTTTAA TAGGAAAATC TGTTATGATG ATGGAAACTA 120 
CTATTTTGTA GTAGACGAGG ACCTAC 146 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CTGTTTAATT GCTGAAGTAG TAAGTTCTCA AGCACTTATA GAATTGACTC ATTTTGTTAA 60 
GGGAAAGAGT ATGGGATCAA GTCCAAATTA GTAAAGACAC AATTATTTTA ACTTTTGCAT 120 
TTCAAAATGT CTTACATAAC AAGACTAGTA A6AACATGAA TCGAAATGCC TGTGATGATG 180 
GTGTTCAAAA TTCAGCTTCA AGGTATGAAT AACAAAAC 218 
(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CTGTATCCAG CAGACATAAT AGGAGT6AAC ATAAAAATGT CACTGGATAA ATAACTTATC 60 

ATGATATTCA GCGGCTACCA ATATTCTGAA GGCCCATGGC GAAAATAAGT ACTTTTATAC 120 

TTTCAGGACG TATATATTTG GATTCTATCT AACAATTGTT CTGAGAATTA TTTAGTTGTA 180 
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GAAATAAATT TAAAATAC 198 
(2) INF0RM;ITI0N FOR SEQ ID NO: 2X: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CTGTGGTTAG AAGCTAAAAG TGAATAGATG AGAAAAATTA CCTCCAAATA AGAGGGATAT CO 

TGAAAAAGAA ACACAATGCA TGAAAAGAAT AAACAAATGA TAAACGAGAA AATTGAATAA 120 

TCCATCAGAA CCCTGGTTAC CTCACAAAGA GTGAGATTTT CCGTGGCTAA CCTATATGAA 180 

CCTTAAAATG CAATAGAAAC AGACAAAC 208 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE^ TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CTGTACAAGT TCATCAAACA TTTCACAATT ACTCCAAAAC AGACACACTT GCAAACTCTA CO 

TACAGTAATC TTCTATACTA CAAAAAAGTA AACAATGTTT TTTTTAAGAT GACATTTGTT 120 

CTCAGCAACA TAATAGAAAT CCCTAGACAA TGGAAACATT CATCATGTTG TTTTCCTCTA 180 

TGTTTCAACC CCTTTGATGT TCAACAGTTC AGGTCATTTT GAGGAAT(3AA TCTTGTTCAA 240 

GTAAGCCAAA CTAATTGTAA TTATCACAAA ATATCTAAAG ATGTAAGACA TAC 293 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CKftRACTERISTICS: 

(A) LENGTH: 376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CTGCATTTCA TCATGAGGGG GAGGAAAGAC GGAGAAATAT AGATATCAGA TTTAGACCAT 60 

TTCAATTAGT ATCACTTCAT TGTAAAGAAA AGGTAAGTAT CCAACAAATA TAGCAGGCTG 120 

TGGATT6GTA GCCTGAAACT ATAGCTTCAA AGAATCAACT TAAGCTGCTC ATCAAGGCCT ISO 

TAGTGGTAGA AATGAGGCG6 TAATAAGTGT AAATGAATCT AATACTT6GA TCTCQAAACA 240 

AAAATCAGAA ATTCGGTTGG AAAATAAGTA GAACAAGATG AAATGAGCTA TCATCCCCAG 300 

AACCAAGTAG ACTTCCAAGT AAGCAATCTA AAAATTACTA GATTATTTAA CAAGCTGCGA 360 

TTCAAAATAC TTGAAC 376 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CTGCAAAGTG AAGTAACTAA TCAGTACAGC TATTACCGAA TTTGACCAGC TATTGGATTA 60 
AATAATATGA AATCCATCAT CAAGAAATGG AAGGTAAAAA (3GTTTCTACT TGTCCTTGGA 120 
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TAGAATTAAA GCACTTCATA AACCCAACAC TTTCAACTTT AGATGATTTT AC 172 

• I 

(2) IKFORMATIOH FOR SEQ ID NO: 25: . 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: MO 
(iv) ANTI- SENSE; YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CTGTTTTCGT CATGCGAGGA TCAGAAAAAA GAGTTAAATT AGACAATGTG AAAATGATTT 60 

GTTTCAGTTA CTTCTCCATA AAACTTGTTC AGTACATTAA AAACAA6CAG AGCAATAATT 120 

TCATGGATAA GTAAAACATA TATAC ' 145 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS I single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:^26: 

CTGTCATGAG AACAGATTGT ATGTCAGCAT GAAGACAAAQ ATCATCAATA AACAGTTTTC 60 

TCCTTTTTGA ATTAGCTAAA CAACGCAGGG GGAGGGCAGG AGGCTCAAAC ACTTCCGAAC 120 

TCAGACAGTC GGATATCTTA TACAACTAAA GATGGATGAG ACAATTACAG TTCTTTTTGG 180 

TGAGAGAACT GTACCCTACA TCTGTTATCT TATTATCAAA AGTTATTCAA GCAAATCCTT 240 

AC 242 
(2) INFORMATION FOR SEQ ID NO: 27: 
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(i) SEQUENCE CKARACTERISTZCS : 

(A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEZ3NESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTZC^: SEQ ID NO: 27: 

CTTCACAAAC AAGGAGAAGA AGAAGCAAAA AGAAAGATGA ATATAGTTAG CTTAGTTCAA 60 

TATAAAAAAT TTCTCTCCAA GCTATTTTTC TGCTAGCAAA ATTCATTAGT TATTTAACTT 120 

TTCTATACAT AAAGCTGCAC AAAGAAATAG TA6TACATTT TTTTGACTTG CACAAAATAA 180 

CTGTGTTGTC CATTTTCTGA CATGTGTTCA TCTACATGCA CTGTTTCAAC AACAACAACT 240 

ACTTCAGTCC CAAACAAGTT GGGTCGCTTT AGCTACACAT GTTGCTTTCA CTTCTGTTAC 300 

TTCTTTTTGG ACTTTTTTTC TTGAGCCAAG GGTCTATTGA AAAAATCCTC TCTACCTCTG 360 

AGATAGGAGT AAGTTTTGCA TACACTCTAC CCTCCCCCTG AAACCACTTT GTGGGACTAC 420 

ACGAGGTATG TTGTTGTTGA TGTTAGCGCA GACACCAAAG GTGGACATTA TATGACTATT 480 

CCTAGCTTTA CTTCAGGGCG GTTTTAAGTT CCCATCAACT TCATTTTTGA TCATTTACCT 540 

AAGTTTATGC AGGT6CAAGC TACATGCACT GGTTTAGGGA AAAAGAGGAT AGAGAAGAAT 600 

TTTTTTGGCA TCCTTTTGTT TTGTAACAGT AAGATGCCAA AAGTAGACCT TATTACGGCT 660 

ATTCCTACCT TTCAAATTAG TAGTTCAGAG GACTTAACTG GCGATT6TG6 CGGTAATCAA 720 

TAGTTAACTT CTATCGCATT CAAATAACTA TGAACAAAAC CACAATAAAA AGGGAG6TCA 780 

CACGGCAAGA ACTGTAC 797 
(2} INFORMATION FOR SEQ ID NO: 28: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

CGAATGGGT7 TTGATAAAAC TTTGAAATTA ATTTCCATTG ATTAAATTAT GGTACTTTGC 60 

TTCAGCTGCT GCCTTCTTGT ATGTTCTGAT TCTTGATTTC CTCATTTTAG TGGCTTTTTA 120 

TAAAAAAACA TTATGACCCT TTTGTTAGTC CTCCCCTTTC TGAATATTTC ACTCAGACCC 180 

CATTAGTTTC GAAATTGAAG TAAAACATAT TTTTTTTAGT ATTGTAGTTT TTTTATATTT 240 

CTACTTACTT ACTCGTTATA CAATTTCTAT TTAGGTAATC TAATCGACTT TTT6TATACA 300 

TAATACATGT ATTTTTGGTA AAGAGTTTTT TACTTTCTCC TAGTGGTAAG 6CAGATATAG 360 

TTAAGGATTT ATTGACCTAA TATGAACGCC AATAATTTTA TATnTGTAT ATAGGTATAT 420 

TTAAAAGTTT ACTAGATATG TATAAATAAG ATATTTAAAA TTTAATTATA AATACAAAT6 460 

ATTATGGTAA AATTTTGACC TCCAAATTAA AATATTTAAA ATCAAGATTT GTCACTACTT 540 

ATATATATCT TGTTGTAAAT CCCTTTTAAT CAAGTTGTGA GTTTACAAAT ATTCGTTGGT 600 

TAGGCTAAAA AAAATAAGCT ATAAAGATCA AGTATAAAAT TATGCATTTT CTGCATTTAA 660 

TTTGGAAAAA TATGTTGGAG CAATCTAAAA TTGTTTGTTG ATTTATAAAT AAGTCGTTTT 720 

TTGTTTTTAA TAATTGATAA ACTATTTATT CTGCTTAAAG TTTTAGAATG TCAAAAAATA 780 

ATTTATTTTA ATGACCTTAA ATGATTGAAT AAGATGTAGA CACACTCAAT TACAAAGTTA 840 

CAATATTAAT ACACTTGTCT ATTGGGTCAT GGATTATATC ATCTAATATA AATAACATGT 900 

CAAATTAAAG CTTCTTATAA AGTTCATAGG AACTAAGATA AACTTTGTGA ATGGCCAAGC 960 

ATTTTTCA6A ACATCATGGG TGGTATGACA ATCAAATTGA ACTTATGGGA TGAAAAATGA 1020 

ATATCATTCA ACTAAGAGGG CACAACTTGA CATGTTAGAA AGTAAAGCAA ATTTAGTAGT 1080 

GGGCCAAATA AAAGAAATTA ATTTGTCAGT TTATTCTTAA ACTTTACCTT CTTTGAACTT 1140 

CCACGTTATC AAAGGTTCAC GGTTCATATG AAGGCCATGT GTATCCTTTT TAATTTTGGT 1200 

ATTCCGTGTT CAATATCGAT TAATTTAAAT TCGCATGACA AAATCCTATA TTAAAGTATA 1260 

AAGTATTTTC TAAAACAGAC AAGTTCAATA CTTTAATTTT ACACTGAATG CATAAATTTA 1320 

CACTATAATA ATTCCAGTCG CAGTCTACAT TACAATAATT AACAATTTTA GCATGAAATG 1380 

AAAAACTTTA AATTATATGC CATCAAATCA CTTAAAGTAT ACATTTTTTT AATAACTAGT 1440 

TCTAATCCCA CTTGAAATGA GAGTTATTTT AATATCGACC GTTAATTACC ATTTTATTAT 1500 
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TAAATCTGCA ACTACAGTCA ACTACACCAA TGATTTTGCT GATGCCAACT CATAATATAA 1560 

TATCCACCGT TCATGTGATT AATTCAATAT TTCATATACX3 TACGTAACAA AAATTACTAA 1620 

ATTAACGTTG GATATACCAT ACCCTAAGCT CTGCCAAATG TCAATGTTCT ATCATTAGCT 1680 

ATTTTTATGC ATCTATAATA GATGTTAAAT TCATATTCTA AGATTGAACT TAATCATAAA 1740 

CTCAAAATTT GTGGTACCTG TCAATGCCTC CAAAAGTTGA TTGAACATAA ACGTTAAGAT 1800 

CTGTGTACTT GTCTTTTCCT TGTAATAATG TATGTATGAT AATAATAATA AGAGAACAAA 1860 

ATATGGCAAA ATAAACACTT TTTTAACATG TAACTCAAAA CAAGTAATAG GCAAAAGTAC 1920 

AGATGACAAC ACAACACTGT AAACATCATT 6AGGAAAACA AAAACCATAC AACATTTTGA 1980 

CTGTAAATGA AGAGTTTGAA AACAAAAACT ATGTTCAAAC G6ACGCCAAG CTAACGAAAA 2040 

TAGCCATAGA GTTCTAAGAA GCAGATGCAA CAGTTCCA06 GGTTAGTATC GTCTGTAGTA 2100 

GGACCGGTCA TGAGAACTCG AAAGAATCTG AAAGGAAGTA ATGCATTTGA ACCAGTAATT 2160 

GGCCATGAT 2169 

4 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11469 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTI<»I: SEQ ID NO: 29: 

ATCATGGCCA ATTACTGGTT CAAATGCATT ACTTCCTTTC AGATTCTTTC GAGTTCTCAT 60 

GACCGGTCCT ACTACAGACG ATACTAACCC GTGGAACTGT TGCATCTGCT TCTTAGAACT 120 

CTATGGCTAT TTTCGTTAGC TTGGCGTCGG TTTGAACATA GTTTTTGTTT TCAAACTCTT 180 

CATTTACAGT CAAAATGTTG TATGGTTTTT GTTTTCCTCA ATGATGTTTA CAGTGTTGTG 240 

TTGTCATCTG TACTTTTGCC TATTACTTGT TTTGAGTTAC ATGTTAAAAA AGTGTTTATT 300 

TTGCCATATT TTGTTCTCTT ATTATTATTA TCATACATAC ATTATTACAA GGAAAAQACA 360 

AGTACACAGA TCTTAACGTT TATGTTCAAT CAACTTTTGG AGGCATTGAC AGGTACCACA 420 
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AATTTTGAGT TTATGATTAA GTTCAATCTT AGAATATGAA TTTAACATCT ATTATAGATG 
CATAAAAATA GCTAATGATA GAACATTGAC ATTTGGCAGA GCTTAGGGTA TGGTATATCC 

5 

AACGTTAATT TAGTAATTTT TGTTAOGTAC GTATATGAAA TATTGAATTA ATCACATGAA 
. CGGTGGATAT TA7ATTATGA GTTGGCATCA GCAAAATCAT TGGTGTAGTT GACTGTAGTT 
10 GCAGATTTAA TAATAAAATG GTAATTAACG GTCGATATTA AAATAACTCT CATTTCAAGT 
GGGATTAGAA CTAGTTATTA AAAAAATGTA TACTTTAAGT GATTTGATGG CATATAATTT 



15 



25 



35 



45 



AAAGTTTTTC ATTTCATGCT AAAATTGTTA ATTATTGTAA TGTAGACTGC GACTGGAATT 
ATTATAGTGT AAATTTATGC ATTCA6TGTA AAATTAAA6T ATTGAACTTG TCTGTTTTAG 
AAAATACTTT ATACTTTAAT ATAGGATTTT GTCATGCGAA TTTAAATTAA TCGATATTGA 



20 ACACGGAATA CCAAAATTAA AAAGGATACA CATGGCCTTC ATATGAACCG TGAACCTTTG 



ATAACGTGGA AGTTCAAAGA AGGTAAAGTT TAAGAATAAA CTGACAAATT AATTTCTTTT 
ATTTGGCCCA CTACTAAATT TGCTTTACTT TCTAACATGT CAAGTT6TGC CCTCTTAGTT 
GAATGATATT CATTTTTCAT CCGATAAGTT CAATTTGATT GTCATACCAC CCATGATGTT 
CTGAAAAATG CTTGGCCATT CACAAAGTTT ATCTTAGTTC CTATGAACTT TATAAGAAGC 



30 . TTTAATTTGA CATGTTATTT ATATTAGATG ATATAATCCA TGACCCAATA GACAAGTGTA 



TTAATATTGT AACTTTGTAA TTGAGTGTGT CTACATCTTA TTCAATCATT TAAGGTCATT 
AAAATAAATT ATTTTTTGAC ATTCTAAAAC TTTAAGCAGA ATAAATAGTT TATCAATTAT 
TAAAAACAAA AAACGACTTA T7TATAAATC AACAAACAAT TTTAGATTGC TCCAACATAT 
TTTTCCAAAT TAAATGCAGA AAATGCATAA TTTTATACTT GATCTTTATA GCTTATTTTT 



40 TTTAGCCTAA CCAACGAATA TTTGTAAACT CACAACTTGA TTAAAAGGGA TTTACAACAA 



GATATATATA AGTAGTGACA AATCTTGATT TTAAATATTT TAATTTGGAG GTCAAAATTT 
TACCATAATC ATTTGTATTT ATAATTAAAT TTTAAATATC TTATTTATAC ATATCTAGTA 
AACTTTTAAA TATACGTATA TACAAAATAT AAAATTATTG GCGTTCATAT TAGGTCAATA 
AATCCTTAAC TATATCTGCC TTACCACTAG GAGAAAGTAA AAAACTCTTT ACCAAAAATA 
50 CATGTATTAT GTATACAAAA AGTCGATTAG ATTACCTAAA TAGAAATTGT ATAACGAGTA 
AGTAAGTAGA AATATAAAAA AACTACAATA CTAAAAAAAA TATGTTTTAC TTCAATTTCG 
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AAACTAA7GG GGTCTGAGTG AAATATTCAG AAAGGGGAGG ACTAACAAAA GGGTCATAAT 
GTTTTTTTAT AAAAAGCCAC TAAAATGAGG AAATCAAGAA TCAGAACATA CAAGAAGGCA 
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50 

GCAGCTGAAG CAAAGTACCA TAATTTAATC AATGGAAATT AATTTCAAAG TTTTATCAAA 2160 

ACCCATTCGA GGATCTTTTC CATCTTTCTC ACCTAAAGTT TCTTCAGGGG TAATTTTTAC 2220 

TAATTTCATG TTAATTTCAA TTATTTTTAG CXTTTTGCATT TCATTTTCCA ATATATCTGG 2280 

ATCATCTCCT TAGTTTTTTA TTTTATTTTT TATAATATCA AATATGGAAG AAAAATGACA 2340 

CTTGTAGAGC CATATGTAAG TATCATGTGA CAAATT7GCA AGGTGGTTGA GTGTATAAAA 2400 

TTCAAAAATT GAGAGATGGA GGGGGGGTGG GGGAAGACAA TATTTAGAAA GAGTGTTCTA 2460 

GGAGGTTATG GAGGACACGG ATGAGGGGTA GAAGGTTAGT TAGGTATTTG AGTGTTGTCT 2520 



15 GGCTTATCCT TTCATACTAG TAGTCGTGGA ATTATTTGGG TAGTTrCTTG TTTTGTTATT 



TGATCTTTGT TATTCTATTT TCTG T TTCTT GTACTTCX5AT TATTGTATTA TATATCTTGT 
CGTAGTTATT GTTCCTCGGT AAGAATGCTC TAGCATGCTT CCTTTAGTGT TTTATCATGC 
CTTCTTTATA TTCGCGTTGC TTTGAAATGC TTTTACTTTA GCCGAGGGTC TATTAGAAAC 
AATCTCTCTA TCTCGTAAGG TAGGGGTAAA GTCCTCACCA CACTCCACTT GTGGGATTAC 



25 ATTGTGTTTG TTGTTGTAAA TCAATTATGT ATACATAATA AGTGGATTTT TTACAACACA 



AATACATGGT CAAGGGCAAA GTTCTGAACA CATAAAGGGT TCATTATATG TCCAGGGATA 
TGATAAAAAT TGTTTCTTTG TGAAAGTTAT ATAAGATTTG TTATGGCTTT TGCTGGAAAC 
ATAATAAGTT ATAATGCTGA GATAGCTACT GAAGTTTGTT TTTTCTAGCC TTTTAAATGT 
ACCAATAATA GATTCCGTAT CGAACGAGTA TGTTTTGATT ACCTGGTCAT GATGTTTCTA 
35 TTTTTTACAT TTTTTTGGTG TTGAACTGCA ATTGAAAATG TTGTATCCTA TGAGACGGAT 
AGTTGAGAAT GTGTTCTTTG TATGGACCTT GAGAAGCTCA AACGCTACTC CAATAATTTC 



TATGAATTCA AATTCAGTTT ATGGCTACCA GTCAGTCCAG AAATTAGGAT ATGCTGCATA 
TACTTGTTCA ATTATACTGT AAAATTTCTT AAGTTCTCAA GATATCCATG TAACCTCGAG 
AATTTCTTTG ACAGGCTTCT AGAAATAAGA TATGTTTTCC TTCTCAACAT AGTACTGGAC 
45 TGAAGTTTGG ATCTCAGGAA CGGTCTTGGG ATATTTCTTC CACCCCAAAA TCAAGAGTTA 
GAAAAGATGA AAGGGTAT6T TTGATAATTT ATATGGTTGC ATGGATAGTA TATAAATA6T 



TGGAAAACTT CTGGACTGGT GCTCATGGCA TATTTGATCT GTGCACCGTG TGGAGATGTC 
AAACATGTGT TACTTCGTTC CGCCAATTTA TAATACCTTA ACTTQGGAAA GACAGCTCTT 
TACTCCTGTG GGCATTTGTT ATTTGAATTA CAATCTTTAT GAGCATGGTG TTTTCACATT 



55 ATCAACTTCT TTCATGTGGT ATATAACAGT TTTTAGCTCC GTTAATACCT TTCTTCTTTT 
TGATATAAAC TAACTGTGGT GCATTGCTTG CATGAAGCAC AGTTCAGCTA TTTCCGCTGT 
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TTTGACCGAT GAC6ACAATT CGACAATGGC ACCCCTAGAG GAAGATGTCA AGACTGAAAA 3900 

TATTGGCCTC CTAAATTTGG ATCCAACTTT GGAACCTTAT CTAGATCACT TCAGACACAG 3960 

AATGAAGAGA TATGTGGATC AGAAAATGCT CATTGAAAAA TATGAGGGAC CCCTTGAGGA 4020 

ATTTGCTCAA GGTAACAGCC AAAAGTTGTG CTTTAGGCAG TTTGACCTTA TTTTGGAAGA 4080 

TGAATTGTTT ATACCTACTT TGACTTTGCT AGAGAATTTT GCATACCGGG GAGTAAGTAG 4140 

TGGCTCCATT TAGGTGGCAC CTGGCCATTT TTTTGATCTT TTAAAAAGCT GTTTGATTGG 4200 

GTCTTCAAAA AAGTAGACAA GGTTTTTGGA GAAGTGACAC ACCCCCGGAG TGTCAGTGGC 4260 

AAAGCAAAGA TTTTCACTAA GGAGATTCAA AATATAAAAA AAGTATAGAC ATAAAGAA6C 4320 

TGAGGGGATT CAACATGTAC TATACAAGCA TCAAATAtAG TCTTAAAGCA ATTTTGTAGA 4380 

AATAAAGAAA GTCTTCCTTC TGTTGCTTCA CAATTTCCTT CTATTATCAT GAGTTACTCT 4440 

TTCTGTTCGA AATAGCTTCC TTAATATTAA . ATTCATGATA CTTTTGTTGA GATTTAGCAG 4500 

TTTTTTCTTG TGTAAACTGC TCTCTTTTTT TGCAGGTTAT TTAAAATTTG 6ATTCAACA6 4560 

GGAAGATGGT TGCATAGTCT ATCGTGAATG GGCTCCTGCT GCTCAGTAGG TCCTCGTCTA 4620 

CTACAAAATA GTAGTTTCCA TCATCATAAC AG A TTTTCCT ATTAAAGCAT GATGTTGCAG 4680 

r 

CATCATTGGC TTTCTTACAT GTTCTAATTG CTATTAAGGT TATGCTTCTA ATTAACTCAT 4740 

CCACAATGCA GGGAAGCAGA AGTTATTGGC GATTTCAATG GATGGAACGG TTCTAACCAC 4800 

ATGATGGAGA AGGACCAGTT TGGTGTTTGG AGTATTAGAA TTCCTGATGT TGACAGTAAG 4860 

CCAGTCATTC CACACAACTC CAGAGTTAAG TTTCGTTTCA AACATGGTAA T6GAGTGTGG 4920 

GTAGATCGTA TCCCTGCTTG GATAAAGTAT GCCACTGCAG ACGCCACAAA GTTTGCAGCA 4980 

CCATATGATG GTGTCTACTG GGACCCACCA CCTTCAGAAA GGTTTTGTTA TTCATACCTT 5040 

GAAGCTGAAT TTTGAACACC ATCATCACAG GCATTTCGAT TCATGTTCTT ACTAGTCTTG 5100 

TTATGTAAGA CATTTTGAAA TGCAAAAGTT AAAATAATTG TGTCTTTACT AATTTGGACT 5160 

TGATCCCATA CTCTTTCCCT TAACAAAATG AGTCAATTCT ATAAGTGCTT GAGAACTTAC 5220 

TACTTCAGCA ATTAAACAGG TACCACTTCA AATACCCTCG CCCTCCCAAA CCCC6AGCCC 5280 

CACGAATCTA TGAAGCACAT GTCGGCATGA GCAGCTCTGA GCCACGTGTA AATTCGTATC 5340 

GTGAGTTTGC AGATGATGTT TTACCTCGGA TTAAGGCAAA TAACTATAAT ACTGTCCAGT 5400 

TGATGGCCAT AATGGAACAT TCTTACTATG GATCATTTGG ATATCATGTT ACAAACTTTT 5460 

TTGCTGTGAG CAGTAGATAT GGAAACCCGG AGGACCTAAA GTATCTGATA GATAAAGCAC 5520 



W09OT7213 

ATAGCTTGGG TTTACAGGTT CTGGTGGATG 
CTGATGGCCT CAATGGCTTT GATATTGGCC 
5 GAGAGCGAGG GTACCATAAG TT6TGGGATA 
TTCTTCGTTT CCTTCTTTCC AACTTGAGGT 
TTCGATTTGA TGGAATAACT TCTATGCTGT 

10 

CAGGAAACTA TAATGAGTAT TTCAGCGAGG 
TGTTGGCCAA TAATCTGATT CACAAGATTT 
15 TTTCTGGTAT GCCGGGCCTT GGCCX3GCCTG 
GCCTGGCAAT GGCAATCCCA GATAAGTGGA 
ATTGGTCCAT GAAGGAAGTA ACATCGAGT7 

20 

TAGCATATGC GGA6ACCCAT GATCAGGTAT 
TCAGAACAAT TGTTAGATAG AATCCAAATA 
25 TTTCGCCATG GGCCTTCAGA ATATTGGTAG 
AGTGACATTT TTATGTTCAC TCCTATTATG 
AGACCATTGC ATTTCTCCTA ATGGACAAAG 

30 

ATGCTTCTCC TGTTGTTGAT CGAG6AATTG 
ATTTTAAGGT TCATATAGGT TAGCCACGGA 
35 TCTGATGGAT TATTCAATTT TCTCGTTTAT 
TCTTTTTCAA TATCCCTCTT ATTTGGAGGT 
TAACCACAfiA TGATCCATTT TTTCACAATG 

40 

ATGGGTAACG AGGTATGTCT TACATCTTTA 
CTTACTTGAA CAAGATTCAT TCCTCAAAAT 
45 AAACATAGAG GAAAACAACA TGATGAATGT 
CTGAGAACAA ATGTCATCTT AAAAAAAACA 
CTGTATAGAG TTTGCAAGTG TGTCTGTTTT 

50 

TACAGTTTGG CCATCCTGAG TGGATTGACT 
ACAAATGTAG ACGCCAGTGG AACCTCGCGG 
55 TATTTTGAAT CGCAGCTTGT TAAATAATCT 
TACTTGGTTC TGGGGATGAT AGCTCATTTC 
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TAGTTCACA6 TCATGCAAGC AATAATX3TCA 5580 

AAGGTTCTCA AGAATCCTAC TTTCATGCTG 5640 

GCAGGCTGTT CAACTATGCC AATTG6GAGG 5700 

G6TGGCTAGA AGA6TATAAC TTT6ACGGAT 5760 

ATGTTCATCA TGGAATCAAT ATGGGATTTA 5820 

CTACAGATGT TGATGCTGTG GTCTATTTAA 5880 

TCCCAGAtGC AACTGTTATT GCCGAAGATG 5940 

TTTCTGAGGG AGGAATTGGT TTTGTTTACC 6000 

TAGATTATTT AAAGAATAAG AATGATGAAG 6060 

TGACAAATAG GAGATA7ACA GAGAAGTGTA 6120 

TTTAAATTTA TTTCTACAAC TAAATAATTC 6180 

TATACGTCCT GAAAGTATAA AAGTACTTAT 6240 

CCGCTGAATA TCATGATAAG TTATTTATCC 6300 

TCTGCTGGAT ACAGTCTATT GTTGGTGACA 6360 

AGATGTATTC TGGCATGTCT TGCTTGACAG 6420 

CGCTTCACAA GGTTTGTCTG TTTCTAT7GC 6480 

AAATCTCACT CTTTGTGAGG TAACCAGGGT 6540 

GATTTGTTTA TTCTTTTCAT GCATTGTGTT 6600 

AATTTTTCTC ATCTATTCAC TTTTAGCTTC 6660 

GCCTTGGGAG GAGAGGGGTA CCTCAATTTC 6720 

GATATTTTGT GATAATTACA ATTAGTTTGG 6780 

GACCTGAACT GTTGAACATC AAAGGGGTTG 6840 

TTCCATTGTC TAGGGATTTC TATTATGTTG 6900 

TTGTTTACTT TTTTGTAGTA TAGAAGATTA 6960 

GGAGTAATTG TGAAATGTTT GATGAACTTG 7020 

TCCCTAGAGA GGGCAATAAT TG6A6TTATG 7080 

ATAGCGAACA CTTGAGATAC AAGGTTCAAG 7140 

AGTAATTTTT AGATTGCTTA CTTGGAAGTC 7200 

ATCTTGTTCT ACTTATTTTC CAACCGAATT 7260 
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TCTGATTTTT GTTTCGAGAT CCAAGTATTA GATTCATTTA CACTTATTAC CGCCTCATTT 7320 

CTACCACTAA G6CCTT6ATG AGCA6CTTAA GTTGATTCTT TGAAGCTATA GTTTCAGGCT 7360 

ACCAATCCAC AGCCTGCTAT ATTTGTTGGA TACTTACCTT TTCTTTACAA TGAAGTGATA 7440 

CTAATTGAAA TGGTCTAAAT CTGATATCTA TATTTCTCOG TCTTTCCTCC CCCTCATGAT 7500 

GAAATGCAGT TTATGAATGC ATTTGATAGA GCTATGAATT CGCTCGATGA AAAGTTCTCA 7560 

TTCCTCGCAT CAGGAAAACA GATAGTAAGC AGCATGGATG ATGATAATAA GGTAAAATCA 7620 

TCTAAAGTTG AAAGTGTTGG GTTTATGAAG TGCTTTAATT CTATCCAAGG ACAAGTAGAA 7680 
ACCTTTTTAC CTTCCATTTC TTGATGATGG ATTTCATATT ATTTAATCCA ATAGCTGGTC . 7740 

AAATTCGGTA ATAGCTGTAC TGATTAGTTA CTTCACTTTG CAGGTTGTTG TGTTTGAACG 7800 

TGGTGACCTG GTATTTGTAT TCAACTTCCA CCCAAAGAAC ACATACGAAG GGTATATATG 7860 

TTTTACTTAT CCATGAAATT ATTGCTCTGC TTGTTTTTAA TGTACTGAAC AAGTTTTATG 7920 

GAGAAGTAAC TGAAACAAAT CATTTTCACA TTGTCTAATT TAACTCTTTT TTCTGATCCT 7980 

CGCATGACGA AAACAGGTAT AAAGTTGGAT GTGACTTGCC AGGGAAGTAC AGAGTTGCAC 8040 

TGGACAGTGA TGCTTGGGAA TTTGGTGGCC ATGGAAGAGT AAGGATTTGC TTGAATAACT 8100 

TTTGATAATA AGATAACAGA T6TAGGGTAC AGTTCTCTCA CCAAAAAGAA CTGTAATTGT 8160 

CTCATCCATC TTTAGriGTA TAAGATATCC GACTGTCTGA GTTCGGAAGT GTTTGAGCCT 8220 

CCTGCCCTCC CCCTGCGTTG TTTAGCTAAT TCAAAAAGGA GAAAACTGTT TATTGATGAT 8280 

CTTTGTCTTC ATGCTGACAT ACAATCTGTT CTCATGACAG ACTGGTCATG ATGTTGACCA 8340 

TTTCACATCA CCAGAAGGAA TACCTGGAGT TCCAGAAACA AATTTCAATG GTCGTCCAAA 8400 

TTCCTTCAAA GTGCTGTCTC CTGCGCGAAC ATGTGTGGTA CAGTTCTTGC CGTGTGACCT 8460 

CCCTTTTTAT TGTGGTTTTG TTCATAGTTA TTTGAATGCX3 ATAGAAGTTA ACTATTGATT 8520 

ACCGCCACAA TCGCCAGTTA AGTCCTCTGA ACTACTAATT TGAAAGGTAG GAATAGCCGT 8580 

AATAAGGTCT ACTTTTGGCA TCTTACTGTT ACAAAACAAA AGGATGCCAA AAAAATTCTT 8640 

CTCTATCCTC TTTTTCCCTA AACCAGTGCA TGTAGCTTGC ACCTGCATAA ACTTAGGTAA 8700 

ATGATCAAAA ATGAAGTTGA TGGGAACTTA AAACCGCCCT GAAGTAAAGC TAGGAATA6T 8760 

CATATAATGT CCACCTTTGG TGTCTGCGCT AACATCAACA ACAACATACC TCGTGTAGTC 8820 

CCACAAAGTG GTTTCAGGGG GAGGGTAGAG TGTATGCAAA ACTTACTCCT ATCTCAGAGG 8880 

TAGAGAGGAT TTTTTCAATA GACCCTTGGC TCAAGAAAAA AAGTCCAAAA AGAAGTAACA 8940 
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GAAGTGAAAG CAACATGT6T AGCTAAAG06 ACCCAACTT6 TTTGGGACT6 AAQTAGTTGT 9000 
TGTTGTTGAA ACAGTGCATG TAGHTGAACA CATGTCAGAA AATGGACAAC ACA6TTATTT 9060 
TGTGCAAGTC AAAA;iAATGT ACTACTATTT CTTT6T6CAG CTTTATGTAT AGAAAAGTTA 9120 
AATAACTAAT GAATTTTGCT AGCAGAAAAA TAGCTTGGAG AGAAATTTTT TATATTGAAC 9180 
TAAGCTAACT ATATTCATCT TTCTTTTTGC TTCTTCTTCT CCTTGTTTGT GAAGGCTTAT 9240 
TACAGAGTTG A7GAACGCAT GTCAGAAACT GAAGATTACC AGACAGACAT TTGTAGTGAG 9300 
CTACTACCAA CAGCCAATAT CGAGGAGAGT GACGAGAAAC TTAAAGATTC GTTATCTACA 9360 
AATATCAGTA ACATTGACGA ACGCATGTCA GAAACTGAAG TTTACCAGAC AGACATTTCT 9420 
AGTGAGCTAC TACCAACAGC CAATATTGAG GAGAGTGACG AGAAACTTAA AGATTCGTTA 9480 
TC7ACAAATA TCAGTAACAT TGATCAGACT GTTGTAGTTT CTGTTGAGGA GAGAGACAAG 9540 
GAAC7TAAAG ATTCACCGTC TGTAAGCATC ATTAGTGATG TTGTTCCAGC TGAATGGGAT 9600 
GATTCAGATG CAAACGTCTG 6GGTGAG6AC ,TAGTCAGATG ATTGATCGAC CCTTCTACCG 9660 
ATTGGTGATC GCTATCCTTG CTCTCTGAGA AATAGGTiSAG 6CGAAACAAA AAATAATTTG 9720 
CATGATAAAA AGTCTGATTT TATGATCGCT ATCCTCGCTC TCTGAGAAAG AAGCGAAACA 9780 
AAGGCGACTC CTGGACTCGA ATCTATAA6A TAACAAAGGC GACTCCTGGG ACTCGAATCT 9840 
ATAAGATAAC AAAGGCAATT CCAAGACTTG AATCTATAAA AAATTTAGTT AAGAATGATT 9900 
AACGTCCGAT CCTAATTCGA ATCGAGGCAT CTTACCACTC CATTGATAAT TATATAAGTC 9960 

AATAAGTCAT ATAAAGTATT AAAAACTAAA TTGACTTGAT CGGTCTATCA AAAATA6ATA 10020 

AATTGTGTTC ATATGTAACA TTTTTGTTGT CACAATTAGC TTAATTACAT CTTTCATGTG 10080 

CAATAACAAA GAAATGATAG GAATTTAGAG ATTCCAATTT TTTTGTTGCC ACAATTAACT 10140 

TAATTACATC TTTCATTTGC AATAACAAAG AAATGATAGG AATTTAGAGA 7CCAG7GTCA 10200 

ATACACAACC TAGGCCAACA TCGAAAGCAT AACTGTAAAC TCATGCATGA AGAAA7CAGT 10260 

CGTAAAAATG AATAAATGCG ACATAAAAAC AAATTGCATG TATCATTAAT GTGACTTAAC 10320 

TACAAGTAAA AATAAATTTA ACAAATGTAA CTTAACTACA AGTAAAAATA AATTGCTTCT 10380 

ATCATTAACA AACAAACAGA ATTAAAAAGA AAAAAACATA CTAAATCTTA CCGTCATTCG 10440 

ATAAAAAAAA ATACCAAATT CATAATGCAA GGAAAAOGAA ACGCGTCCTG ATCGGGTATC 10500 

AACGATGAAA TGGACCAGTT GGATCGACTG CCTGCACAAC GTTAGGTATG CCAAAAAAAA 10560 

GAACACGATC CTTTGCACCC GTTCGATGAT TATCAGTATG TTCACAAAAA AAACTTAAGT 10620 

TCATCCCAGT GTACAACAGC CCCAACATCT GCCCCAAGTA ACAAAAAACA ACCAATTTAT 10680 
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CTTATTCTTA TCTGCCACAA AATAATCGGT TTCACACTAT TCTCTTGTTA TACAAAATTG 10740 
ACAAGTA6GA AGGAGAGGAG TCATCCAAAT AAACGGTGCA CGTTCTTTGA GAAAAGTCTT 10800 
ATTTTTCGTA AGATCCAATT TCAACAAACT TTT C TTCAAS TCAAAATTCC TGATAGTGTA 10860 
TCTCCTCTCG ACGACCTCTT GCATTGAAC6 ATCTCCGCTT ATCATGAAAA GTTGCTTGGA 10920 
TAACAAGTAT TGCAAGGGGG GGACAGTAfiC TATTAAGTTA 6TGGGCCCAA GGAAATGGAG 10980 
GAGTGATAGT CTCGAATATT ATTCACCTCT TTAGCATTAC CCGGTCTGGC TTTAAGGAGT 11040 
TACGTCTTTT ACGCTCGCCA ATTTCTTTTT TTAGAATGGT TGGTGTCAAA ATCGCGAGTT 11100 
GTGGAAGGTT CAAGTTACTC GATTCGTGAT TTTCAAGTAT GAGTGGTGAG AGAGATTCGA 11160 
TATTTTCACG AGGTGTATTC GAGGTCTAGT AGAACGAAGG GTGTCACTAA TGAAAGTTTC 11220 
AAGAGTTCAT CATCATCTTC TTCTAGTAGA TTTTCGCTTT CAAATGAGTA TGAAAATTCT 11280 
TCCTCTTTTC TATTGATTTT CTTCATTGTT TTCTTCATTG TTGTGGTTGT TATT6AAAAG 11340 
AAAGAAAATT TATAACAGAA AAAGATGTCA AAAAAAAGGT AAAATGAAAG AGTATCATAT 11400 
ACTTAAAGAG TTGCGTAGAG ATAAGTCAAA AGAAACAGAA TTATAGTAAT TTCAGCTAAG 11460 
TTAGAATTC 11469 
<2) INFORMATION FOR SEQ ID NO: 30: 

* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "Synthetic DNA Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GGAATTCCAG TC6CAGTCTA CATTAC 26 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "Synthetic DMA Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CGGGATCCAG AGGCATTAAG ATTTCTGG 28 
(2) INFORTIATION FOR SEQ ID NO: 32: 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic DNA Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CGGGATCCAA AGAAATTCTC GAGGTTACAT GG 32 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "Synthetic DNA Primer** 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
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CGGGATCCGG GGTAATTTTT ACTAATTTCA TO 32 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "Synthetic DNA Primer" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CGGGATCCCG TATGTCTCAC TGTGTTTGTG GC 32 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "Synthetic DNA Primer- 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CGGGATCCCC CTACATACAT ATATCAGATT AG 32 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc » "Synthetic DNA Primer" 



wo 98/37213 



PCT/IB98/00270 



58 



(iii) HYPOTHETICAL: NO 
(iv) AMTZ -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CCATCGATAC TTTAAGTGAT TTGATGGC 28 
(2) INFORMATION FOR SEQ ID NO: 37; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "Synthetic DNA Primer" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CGGGATCCTG TTCTGATTCT TGATTTCC 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GTATGTCTCA CTGTGTTTGT GGCTGTGTGT GTTTTTTTCT CTGTCTTTTT GTGTTTTGTG 
TAATTGGGGC TCTTTAAAGT TGGTATTGTG TATACCCTTT TGAGTATAGT CTTTGAGGAA 



60 
120 
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GCAAAATGAT GAATCTTGAT TGACATTAGT AAGGGTTGTA ACTTTTTGAA GTTTGGTTAC 180 

GTGTAATTGA GTTTGGCTTG TGT6TCTGTG TGTOGAGGTT ATTTTTTTGG TTTGTGTTAT 240 

TGGGGATTCT TAAAAGTIGG TATTGT6TAT ACCCTTTTGA GTATAGTCTT TGAGGAAGCA 300 

AAAATGATGA ATCTTGATTG GCATTAGTAA AOGTTGTAGC TTTTTGAAGT GTGGTTAGGT 360 

GTAATTGAGT TTGGCTTGTG TGTCTGTGTG TTTTGGAATC CTGATGTGTG TCAAGTCCTG 420 

ATATGGGTCG AGGTTCTTTC TTTGGTTTGT GTAATTGGGG GTTCTTAAAA GTTGGTATTA 480 

TGTACCTTTT TAAGAATAGT GTCTGAGAAA GCAAAATCGA TGAATTTTGA TTGACAGCAT 540 

ATTCTTTGAG AAAGCAAAAA ATGGTGAGTT TTCATGGAGA AACTTGATTG ACATTACTAA 600 

AGGTAGCAAC TTTTTCAACT CCTGATATGG GTCAAGGTTC TTTGTTTGGT TTGTGTAATT 660 

TGGGGTTCTT TGAAGTTTTG AGAAAGAAAA ATTATGATTT TTCATGGAGA AATTTGATTT 720 

ACATTAATAA AGGTAGTAGC TTTTTAAAGT GTGGTCAGCT GTAATGAGTT CAGCTTGGTT 780 

TAAAGGGGCC CTACATATGG TGCTTTCTGG TGAGATATTT GTTGCTCCAC CATACGAGTT 840 

ATAAGAATCA TAGTGTTAGG ATCTTTTTTC ri"m ' rrrrr CATTTTTCAC TTGACTAGCT 900 

ACTA6AGGAG TGATCTTGAC GGCGGAAAAT CTTAGAAAGG GGAAGGTTGT TTGCATCAAC 960 

TGGTGTTATA TGTGCAAGGA GACGGGAGAT GATGTAGATC ATCTTCTTCT TCATTGTGGT 1020 

CTTTCCATGA GGTTATGATG TGATATGTTT GAATGGTTTG GTACTTCTTG GCTATGCCAA 1080 

GAACTGTGAA AGAATTGATA TTCA6TTGGA AGTGTGGAGT TGGAAGAGTG GAAGAATTGA 1140 

CACTTGGTTC CATTAGCTTT AATGTGGGTG GTGTCGAGAG AGAGAGAAAT AGGAGAGCTT 1200 

TTGAGGGGGT AGAGTTGAGC TTTCCTCAGT TGAGAAGTAG CCTTTGATAT CrrmT T TT 1260 

TTmTTGTA CACCCATAGA ATTCCCAATT GTATAGAAGA TTGGGTGGAG TTTGTAGAGA 1320 

ATCATCTTTT GTAGTAGATT CTTTACCTTT TGGTATATCC ATTGTATACA GCCAGGCCTT 1380 

TGACTATGTT TAT6AATGAA TATACATTAC TTGAAAAAAA AAGAAGTGAA GCCAGTCTGT 1440 

TGTACCTTTG TAGACAATGT TiSTTGCAGCA TCTTGATAAT TCCCTGAAAA TTGTCTCCCT 1500 

GAAGGAATAG TTTGGTTGAT ATTGATTATT TCTTGGTTTG TTTAATTCGG TGTTCTTGAA 1560 

GGCCATTTTA AATCCTTTGA CATTGTTAAA GGTGTTTACA AGTGTTGGTC TGGGTTTAAA 1620 

AGCACCTCTT GTATGGTGCT TTCTGGAGTG ATCTTTCTTC CTCCAAAAGA GAAGTTGCAA 1680 

GAATCAGTGT GTGTACTTTT TTCTCTTGTA TGATCAGATC TTTTTTCAAT TTTTCCGTTT 1740 

TAGTTGATTT ATCCATATAG TGAAAGTTGG TGTCATAGTT GCTGTTTGTG GACTTCCTGT 1800 

AAAAGTTTTT TGATATACTT AAAAAATTGT CACACAGAAG AAAGAGTTTT TTACCATTAC 1860 
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TTAAGCTAGA TGGGACTGTT TGATTCTTAG ACCAAATAAT GAACCTTTTT GTTCTCTTAA 1920 

CGTGTACTTG AAATAGTTTP GTAAAATTGT GATAGGAAAA AAGATAATTC TTGATTGCTT 1980 

TTGGAGCATC ACTTC7AATC ATAAAA6TCT TTGCTCTCTT CAACCATGAA TGATAAATIG 2040 

GACACTTATG TGGCCCTAAG TTGCTCTCAG TAGTGGTCTT TAATTGTGGA GATATAACTA 2100 

ATCTGATATA TGTATGTAGG GA 2122 
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CLAIMS 

1. A method of afTectihg enzymatic activity in a plant (or a cell, a tissue or an 
organ thereoO comprising expressing in die plant (or a cell, a tissue or an organ thereof) a 

5 nucleotide sequence wherein the nucleotide sequence codes, partially or completely, for 
an intron of a class A potato starch branchii^ enzyme in an antisense orientation, 
optionally together wtdi a nucleotide sequence which codes, partially or completely, for 
an intron of a class B starch branching en^me in an antisense or sense orientation; and 
wherein the nucleotide sequence does not contain a sequence that is antisense to an exon 

10 sequence normally associated with the intron. 

2. A method according to claim 1 wherein starch branching enzyme activity 
is affected and/or wherein the levels of amylopectin are affected and/or the composition 
of starch is changed. 

15 

3. A method of a^cting enzymatic activity in a starch producing organism 
(or a cell, a tissue or an organ thereof) comprising expressing in the starch producing 
organism (or a cell, a tissue or an oigan thereof) a nucleoude sequence wherein the 
nucleotide sequence codes, partially or completely, for an intron of a class A starch 

20 branching enzyme in an antisense orientaUon, optionally together with a nucleotide 
sequence which codes, partially or conq>letely, for an intron of a class B starch branching 
enzyme in an antisense or sense orientation; and wherein starch branching enzyme 
activity is affected and/or the levels of amylopecun are affected and/or the composition of 
starch is changed. 

25 

4. A method accord'mg to claim 3 wherein the nucleotide sequence does not 
contain a sequence that is antisense to an exon sequence normally associated with the 
intron. 

30 S. A method according to any one of the preceding claims wherein the 

enzyn)atic activity is reduced or eliminated. 
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6. A method accoxding to any one of the preceding claims wherein the 
nucleotide sequence codes for at least substantially all of at least one intron in an antisense 
orientation. 

5 

7. A method accorcUng to any one of die preceding claims wherein the 
nucleotide sequence codes for all of at least one intron in an antisense orientation. 

8. A method according to any one of the preceding claims wherein the 
10 nucleotide sequence comprises the complement of SEQ. ID. No. 38, or a fragment 

thereof. 



9. A method according to any one of the preceding claims wherein the 
nucleotide sequence is expressed by a promoter having a sequence shown as SEQ.I.D. 

IS No. 14 or a variant, derivative or homologue thereof. 

10. An antisense sequence comprising the nucleotide sequence as defined in 
claim 8 or a variant, derivative or homologue thereof. 

20 11. A promoter having a sequence shown as SEQ.I.D. No. 14, or a variant, 

derivative or homologue thereof. 

12. A promoter according to claim II in combination with a gene of interest 
("GOD. 

25 

13. A construct capable of comprising or expressing the invention according to 
any one of claims 10 to 12. 

14. A veaor conQ)rising or expressmg the invention according to any oiie of 

■ 

30 claims 10 to 13. 



10 
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15. A combination of nucleotide sequences comprising a Hrst nucleotide 
sequence coding for a recombinant enzyme; and a second nucleotide sequence which 
comspoads to an intron in antisense orientation; wherein the intron is an intron that is 
associated with a genomic gene encoding an enzyme corresponding to the recombinant 
etizyme; and wherein the second imcleodde sequence does not contain a sequence that is 
antisense to an exon sequence normally associated with the intron. 

16. A celU tissue or organ comprising or expressing the invention according to 
any one of claims 10 to 15. 

17. A transgenic starch producing organism coii^)ri$ing or expressing the 
invention according to any one of claims 10 to 16. 



18. A transgenic scaich producing orgam'sm according to. claim 17 wherein the 
15 organism is a plant. 

19. A starch obtained from the invention according to any one of the preceding 

claims. 

20 20. A nucleotide sequence that is antisense to an intron of class A SBE. 

21. A method for modifying starch production in an organism, comprising 
transforming the organism with a transgene capable of expressing an antisense intron 
sequence relating to class A SBE and a transgene capable of expressing an antisense 
25 intron sequence relating to class B SBE, thereby reducing or eliminating endogenous class 
A and class B production, and a funher sequence encoding a SBE from a heterologous 
source. 
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10 



20 



30 



40 



50 



60 



12345678901234 567a<?0I234S6789Q12345678901234567890123456789Q 
ATCATCGCCAATTACTGGTTCAAATGCATTA 

GACCGGTCCTACTACAGACGATACTAACCCGTGGAACTG 

CTATGGCTATTTTCGTTAarrTGGO^ 

OlTrTACAGTCAAAATGTTGTATGGTT^^ 

TTGTCATCTGTACTTlTGCCTATrACTT^^ 

TTGCa^TATTTTC?ITCTCTTATTATr^ 

AGTACACAGATCTTAACGrrXAlXnTC^ 

AATTTTGACrmATGATTAAGTTCAATC^ 

CATAAAAATAGCTAATGATAGAACATTGACAT^^ 

AACGTTAATTTAGTAATTTTTGTTACGTAar^ 

CGGTGGATATTATATTATGAGTTGOCATCAGCAAAATCATT^^ 

GCAGATTTAATAATAAAAIXXn^AATTAACGGTCGATATTAAAATAACTC^^ 

GGGATTAGAACTAGTTATTAAAAAAATGTATACTTTAAGTGATTT^ 

AAAGTITTTCATTTCATGCTAAAATTCTTAAT^^ 

ATrATAGTGTAAATTTATGCATTCAGTGTAAAATTAAAGTATTGAA 

AAAATACTTTATACTTTAATATAGGATTTTGTCATGCG^^ 

ACACGGAATACCAAAATTAAAAAGGATAaVCATGGCXTTCATAT^^ 

ATAACGTGGAAGTTCAAAGAAGGTAAAGTTTAAGAATAAACTt^^ 

ATTT^CCACTACTAA ATTiliL ' nT A C ' mC rA ACATGICAAC iTC l«: CCn ' Cl TAGTT 

I 

GAATGATATTCATTTTrCATCCCATAAGTTCAATTT^ 

CTGAAAAATXXrrTGGCCATTCACAAAGTTTATCTTAGTO 

TTTAATTTGACATGTTATTTATATTAGATGATATAATCC^^ 

TTAATATTGTAACTTTGTAATK^GTGTGT^^ 

AAAATAAATTATTTTTTGACATTCTAAAACTTTAAGCAG^^ 

TAAAAACAAAAAACGACTTATTTATAAATCAACAAACAATTTT^ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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rr rr C CAAATTAAATGCAGAAAATGCATAATTTTATACTr^^ ^ 1560 

TTrAGCCTAACCAACGAATATTTGTAAACTCACAACT^^ 1620 

GATATATATAAGTAGTGACAAATCTTGATTTT^ 1680 

TACCATAATCATTTGTArrrATAATrAAATTnAAATATCTTA^ 1740 

AACTTTTAAATATACCrATATACAAAATATAAAATTArrGa 1800 

AATCOTAACTATATCnXX:CITACCACTAGCAGAAAGTAAA^ 1860 . 

CATGTATTATGTATACAAWAGTCGATTAGATrACCTAAATAGAA^ 1920 

AGTAAGTAGAAATATAAAAAAACTACAATACTAAAAAAAATATCrn^ 1980 

AAiCTAATGGGGTCTGAGTGAAATATTCACIAAAGGGGAGG^ 2040 

GTTTTTXSSSBaAAAGCCACTAAAATGAGGAAAT^^ 2100 

GCAGCTGAAGCAAAGTACCATAATITAATCAATGGAAArrAATT^^ 2160 

MEINFKVLSK 

ACCCATTCGAGGAT LlT^l. C A 'IUm r I^::ACCTAAA GrriC ^ ^ ^ 2220 

PIRGSFPSFSPKVSSG 

taatttcatgtcaacttcaattattttcagcctttgcatttcattttccaatatatctgg 2280 

accatctcctcagtticttcactttattcctcataatatcaaatatggaagaaaaacgaca 2340 

cttgtagagccatacgtaagtatcacgtgacaaacttgcaaggtggctgagtgtataaaa 2400 

ttcaaaaat tgagagatggagggggggtgggggbaragacaat at t: tagaaagagtgttc 24 60 

taggaggccatggaggacacggatgaggggcagaaggttagttaggtatttgagcgttgt 2520 

ccggctcatcctcccataccagtagtcgtggaattatccgggtagcctctcgtcctgtta 2580 

Ctcgatcttcgttattccactctccgtttcttgtactccgattactgtactatacatctt 2640 

gtcgcagttaccgctccccggcaagaacgccctagcatgctccctctagtgctccatcac 2700 

gccccctttatattcgcgccgctttgaaacgctcctactttagccgagggtccactagaa 2760 

acaatctctctatctcgtaaggtaggggtaaagtcctcaccacactccacctgtgggatt 2820 

acattgtgttcgttgctgtaaaccaactatgtatacataacaagtggattttttacaaca 2880 

caaatacatggccaagggcaaagttccgaacacataaagggtccactatacgcccaggga 2940 

catgacaaaaaccgtttctttgtgaaagttatataagatttgttacggcccttgctggaa 3000 
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acacaacaagttataatgccgagacagccaccgaagtttgttccttccagccttttaaac 3060 

gtaccaataatagattccgtatcgaacgagcatgtcttgattacctggtcatgacgtttc 3120 

tattttttacatttttttggtgttgaactgcaattgaaaatgttgtatcccatgagacgg 3180 

atagttgagaacgcgtccctcgtatggacctcgagaagcccaaacgctaccccaataatc 3240 

tctacgaattcaaatccagtttacggctaccagtcagcccagaaatcaggatatgccgca 3300 

tatactcgttcaattatactgtaaaatttcttaagttctcaagatatccatgcaacctcg 3360 

agaatttctttgacagGCTTCrAGAMTAAGATATGTTTTCCT^^ 3420 

ASRNKICFPSQHSTG 

ACTGAAGTTTCGATCTCAGGAACGGTCTTGGGAT^ 3480 

LKFGSQERSWDISSTPKSRV 

TAGAAAAGATGAAAGGgtatgtttgataatccatacggtcgcacggacagcatataaata 3540 

R K D E R 

gttggaaaacctctggaccggtgctcatggcacatttgatctgtgcaccgtgtggagatg 360D 

tcaaacatgtgtcacttcgttccgccaatctataacaccctaacctgggaaagacagctc 3660 

tttactcctgtgggcatttgttatttgaactacaatctttatgagcatggtgtcttcaca 3720 

ttatcaacttctttcatgtggtatataacagtttttagctccgttaatacccttcttctc 3780 

Cttgatataaactaactgtggtgcactgcttgcbk}OcATGAAGCACAGTTCAGCTAT^ 3840 

MKHSSAZS 

CGCTGTTTTGACCGAT^CGACAATTCGACAATGCKIA^ 3900 
AVLTDDDNSTMAPLEEDVKT 

TGAAAATATTGQCCTCCTAAATITOCaiTCCAACTrrGGAAC^^ 3960 
ENIGLLNLOPTLEPYLDHFR 

ACACAGAATC^GAGATATGTGGATCAGAAAATCXrTCJ^^ 4020 
H R M K RYVDQKMLIE KYEGPL 

TGAGGAATTTGCTCAAGgtaacagccaaaagttgtgctttaggcagtctgaccttatttt 4080 
E E F A Q G 

ggaagacgaattgtttacacctactttgactctgctagagaatcttgcacaccggggagt 4140 

aagtagtggctccattcaggcggcacctggccattctttcgatccttcaaaaagctgttt 4200 

gactgggtcttcaaaaaagtagacaaggcctttggagaagtgacacacccccggagtgtc 4260 

agtggcaaagcaaagattc tcaccaaggagat tcaaaatataaaaaaagtatagacataa 4320 

agaagctgaggggatccaacacgtaccacacaagcaccaaatatagccctaaagcaactt 4380 

cgtagaaacaaagaaagtcctccttctgtcgcctcacaatttccccctatcaccatgagc 4440 

tactctttctgttcgaaacagcttcctcaataccaaattcatgatactcccgtcgagatt 4500 
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tagcagtcccctcttgtgtaaaccgctctctttttttgcagCnTATITAAAATTTGG^TT 4560 

Y L K F G ? 

CAACAGGGAAGATGGTTGCATAGTCTATCGTGAATCX^^ 4620 

N REDGCIVYREWAPAAQ 
cgcccaccacaaaatagcagrctccaccatcacaacagatcctcctatcaaagcatgacg 4680 

ttgcagcaccattggcctccctacacgctccaattgctattaaggttacgcttccaatta 4740 

actcatCCacaacgcagGGAAGCAGAAGTTArrGGCGATTTCAATGGATCX^^ 4800 

EAEVIGDFNGWNGS 
AACCACATGATGGAGAAGGACCAGmGGTGTTTGGAGTATTAGAATTC 4860 
NHMM EKDQFG VWSIRIPDVD 
AGTAAGCCAGTCATTCCACACAACTCCAGAGTTAAGm 4920 
SKPVI PHNSRVKFRFKHGNG 
GTCTGGGTAGATCGTATCCCTGCTTGGATAAAGTATGCCACT^^ 4980 
V WV DRI PAWIKYATADATKF 
GCAGCACC^TATGATGGTGTCTACTGGGACCCACCACCTT^^ 5040 
AAPYDGVYWDPPPSER 

taccctgaagctgaattttgaacaccatcaccacaggcatttcgattcatgttcttacca 5100 
gtcttgttatgtaagacattttgaaatgcaaaagttaaaataactgtgtctttactaatt 5160 
tggacttgatcccacactctctcccttaacaaaatgagtcaactctataagtgcccgaga 5220 

actcactactccagcaattaaacagGTACCACTTCAAATACCCTCGCariCC^^ 5280 

YHFKYPRPPKPR 
GAGCCCCACGAATCrATGAAOCACATGTCGGCATGAGCAOCTCT^^ 5340 

APRZ YEAHVGMSSSEPRVNS 
CGTATCGTGAGTrTGCAGATGATGTTTTACCTCGGATT 5400 

YREFADDVLPRZKAKNYNTV 
TCCAGTTGATOGCCATAATGGAACATTCTTACTAT^ 5460 

QLMA IM EHSYYGSF GYHVTN 
ACTTTTTTGCTGTGAGCAGTAGA^ 5520 

FFAVSSRYGNPEDLKYLlbK 
AAGCACATAGCTTGGGTrTACAOrrTCTGC^^ 5580 

AHSLGLQVLVDVVHSHASNN 
ATGTCACTGATGGCCTCAATG GCrr T G ATATTC^ 5640 

VTDG LNGFDIGQGSQBSYFH 
ATGCTGGAGAGCGAGGGTACCATAA G TT G T GG GATAGCAGGCr^^ 5700 

AGERGYHKLWDSRLFNYANW 
GQGAG GVlX^n 'CGTTTC CrnirntX AACrTGAGGT^^ 5760 

EVLRF LLSNLR WWLEEYMFD 
ACGGArnCGATTTGATGGAATAACTTCTATGClCT 5820 

GFRFDGITSMLYVHKGINMG 
GATTrACAGGAAACTATAATGAGTATTTCAGCGAGGCTAC^GAT^^ 5880 

FTGNYNEYFSEATDVDAVVY 
ATTTAATGTTGGCCAATAATCTGAXTCACAAGATTTTCCC^ 5940 

L ML ANNLIHKIFPDATVIAE 
AAGATGTTTCTGGTATGCCGGGCCTTGGaXyyrCT^^ 6000 

DVSGMP GLGRPVSEGGIGFV 
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TTTACTGCCTGGCAATGGCAATCXCAGATAAGTaSATAGA^^ 6060 
YRLAMAIPDKWIDYLKNKND 

ATCy^AGATTGGTCCATGAAGGAACTAACATC 6120 
EDWS MKEVTSS LTNRRYT EK 

ACTGTATAGCATATGCGGAGACCCWSATCAGgtatcttaaacccacttctacaactaaa 6180 

CI AYAETKDQ 

caactctcagaacaactgccagatagaacccaaacacacacgtcccgaaagtataaaagc 6240 

actcatccccgccatgggccttcagaatatcggcagccgctgaacatcacgataagtcat 6300 

ctatccagtgacattcctatgtccactcctactacgtctgctggacacagTCTATTGTTG 6360 

S I V G 

GTGACAAGACCATTGCATTTCTCCTAATCXaCAAAGAGAT^ 6420 
DKTIAFLLMDKEMYSGMSCL 

TGACAGATGCrrCTCCTGTTGTTGATCGAGGAATT^ 6480 
TDASPVVDRGiALHX 

tattgcatcctaaggtccatataggttagccacggaaaaccccactccttgtgaggtaac 6540 

cagggtcctgatggaccattcaattccctcgcctaccacttgtccactcttcccacgcac 6600 

tgtgtttcttttccaatatcccccttattcggaggtaattcctcccatccacccactttc 6660 

agct tec aaccacagATGATCCATTTTTrCACAATGGOTTGG^ 6720 

MIHFFT HALGGEGYL 

AATTTCATGGGTAACGAGgcatgccctacaccctcagatacttcgtgataatcacaacta 6780 
N F M G N E 

gtctggcccacttgaacaagactcactcctcaaaatgacccgaactgtcgaacatcaaag 6840 

gggttgaaacatagaggaaaacaacatgatgaacgcttccattgcccagggacctctact 6900 

atgttgctgagaacaaatgtcaccttaaaaaaaacattgtttacttttttgtagtataga 6960 

agattactgtatagagtttgcaagtgtgtctgttttggagcaattgtgaaatgtttgacg 7020 

aacttgtacagTTTGGCCAtCCTGAGTGGATTGAC^ 7080 

FGHPEWIDFPREGNNWS 

, GTTATGACAAATGTAGAOXrCAGTOGAACCTCGCGGATAGCGAACAC^^ 7140 
YDKC RRQ WNLA DSEHLRYK 

tccaagcaccccgaaccgcagcttgctaaataatctagtaacccctagactgcccacccg 7200 

gaagcccacctggttctggggatgacagctcacttcatcttgttctacttattttccaac 7260 

cgaatttctgacccttgttccgagatccaagtattagattcacccacacttaccaccgcc 7320 

tcatttctaccactaaggccttgatgagcagcccaagttgaccctttgaagctatagttc 7380 

caggctaccaacccacagcctgctatatttgttggatacttacctcctctttacaatgaa 7440 

gtgacactaattgaaatggtccaaatctgacatctatatttctccgtccctcccccccct 7500 
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cacgatgaaacgcagTTTATGAATGCATTTGATAGAGCTATG^^ 7560 

FMNAFDRAKMSLDEK 

rrCTCATTCCTCGCATCVXSAAAACAGATAGTAAGC^^ 7620 
FSPLAS CKQXVSSMDDDMK 

aaaccatctaaagtcgaaagtgttgggcttacgaagtgccctaatcccatccaaggacaa 7680 

gtagaaacctttttaccttccatttctcgacgatggatctcatattatctaatccaatag 7740 

ccggtcaaattcggcaatagctgtactgatcagttacttcactttgcagGTTGTTGTGTT 7800 

V V V F 

TGAACCTGGTGACCTGCSTATrTGTArrCAACTTC 7860 

ERGDLVFVFNFHPKNTYEG 

catatgccttactcatccacgaaactactgccctgcttgtctccaacgcactgaacaagt 7920 

cttatggagaagcaactgaaacaaaccactctcacactgcccaatitcaaccctcctttcc 7980 

gaccctcgcatgacgaaaacagGTATAAAGTTGGATGTGACTTGCCACCX;^^ 8040 

YKVGCDLPGKYRV 

TTGCACTGGACAGTGATGCTTGGGAATnXXnt^ 8 100 

ALDSDAWEFGGHGR 

acaacccttgataacaagacaacagatgtagggtacagttctcccaccaaaaagaaccgc 8160 

aattgccccatccatctctagccgtacaagatatcogactgtctgagttcggaagtgttc 8220 

gagcctcctgccccccccctgcgtcgtttagccaatccaaaaaggagaaaactgttcatt 8280 

gatgatctttgcctccatgctgacacacaatccgctctcatgacagACTGGTCATGATGT 8340 

T G H D V 

TGACCATTTCACATCACCAGAAGGAATACCTGGAGTTCCAGAAAC^^ 8400 
DHFTSPEGIPGV PETNFNGR 

TCCAAATTCCTrCAAAGTGCTGTCrCCTGCGCGAACAT^^ 84 60 
PNSFKVLSPARTCV 

tgacccccccttctattgtggttttgctcacagttat&tgaacgcgacagaagttaacta 8520 

ttgactaccgccacaatcgccagttaagccctctgaactactaatttgaaaggtaggaat 8580 

agccgtaataaggcctacttttggcatcttactgccacaaaacaaaaggatgccaaaaaa 8640 

attcttctctaccctctttttccctaaaccagcgcatgtagcttgcacctgcataaactt 8700 

aggtaaatgatcaaaaatgaagctgacgggaacttaaaaccgccctgaagtaaagccagg 8760 

aatagtcatataacgcccacccctggtgtccgcgctaacatcaacaacaacatacctcgc 8820 

gtagccccacaaagtggtttcagggggagggcagagcgtatgcaaaacccacccctaccc 8880 

cagaggtagagaggaccccttcaatagacccccggctcaagaaaaaaagcccaaaaagaa 8940 

gtaacagaagtgaaagcaacatgtgcagccaaagcgacccaacttgcctgggaccgaagc 9000 
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agtcgctgccgttgaaacagcgcacgcagacgaacacatgtcagaaaacggacaacacag 9060 

tcaccttgtgcaagtcaaaaaaatgtaccactatcccttcgcgcagccctatgtatagaa 9120 

aagttaaataactaacgaattccgctagcagaaaaacagcttggagagaaacttcttata 9180 

ttgaactaagctaaccacattcatcxtcctttctgcttcttcttctccccgtttgtgaag 9240 

GCTTATTACAGAGTTGATGAACGCATGTCAGAAACTGAAG^rr^ 9300 
AYYRVDERM SETEOYQTDIC 

AGTGAGCTACTACCAACAGCCAATATCGAGGAGAGTGACGAGAAACTT^ ^ 9360 
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